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THE HIPPOCAMPUS AND MOTIVATION" 


LEONARD E. JARRARD * 


Carnegie-M ellon University 


The available literature suggests that several different behavioral processes may 
be affected when the hippocampus is experimentally manipulated. In addition 
to convincing evidence that the structure is involved in human memory, and 
findings from research indicating a role in response perseveration, the evidence 
reviewed in this paper suggests that the hippocampus is involved in motivation. 
Specifically, it is proposed that the hippocampus functions as a negative feed- 
back mechanism serving to inhibit the activity of lower structures more in- 
timately involved in the control of nonspecific levels of activation and pos- 
sibly some types of incentive motivation (thirst and hunger). Evidence is re- 
viewed that supports the view that all regions of hippocampus are not func- 
tionally equal. This functional dissociation is characterized by (a) the antero- 
dorsal CA1 cell fields that project via dorsal and postcommissural fornix to 
hypothalamus and thalamus being involved in incentive motivation, (b) 
CA3-CA4 cell fields that send axons through fimbria to the septal region being 
more concerned with the inhibiting of responses, and (c) all cell fields con- 
tributing to the influence that hippocampus has on nonspecific levels of 


activation. 


for any one theory to the exclusion of the 
others since a variety of effects оп behavior 
have been reported when this limbic structure 
is surgically ablated, or chemically and elec- 
trically stimulated. This is not surprising 
when one considers that damage to hippocam- 
pus has varied widely both in location and 
extent, and implanted electrodes often differ 
in location within hippocampus. Comparisons 
are further complicated by differences in be- 
havioral testing procedures and differences in 
species that have been employed as subjects. 

A careful review of the literature does not 
support the view that the hippocampus is 
concerned with a single function. Rather, 
available neuroanatomical, neurophysiological, 
and behavioral data suggest that several dif- 
ferent behavioral processes may be affected 
when the structure is experimentally altered. 
There is convincing clinical evidence that the 


Many investigators have thought of the 
functions of the hippocampus in terms of a 
unitary process. Several different points of 
view have been proposed: (a) The early for- 
mulation that the structure is concerned with 
the integration of emotional and motivational 
aspects of behavior (Kluver & Вису, 1937; 
Papez, 1937). (5) The hippocampus plays a 
role in the mechanism of attention by inhibit- 
ing the orienting reflex (Grastyan, 1959). 
(c) The structure is primarily concerned with 
memory, more specifically, "recent" memory 
(Meissner, 1966; Milner, 1968; Milner & 
Penfield, 1955). (d) The hippocampus is in- 
volved in inhibiting learned and unlearned 
responses (Douglas, 1967; Kimble, 1968; 
Pribram, 1967). (e) Hippocampal activity is 
closely related to motor responses, especially 
voluntary types of movement (Vanderwolf, 


1971). It has been difficult to obtain support 


1 This research was supported by National Science 
Foundation Grants GB-8044 and GB-30113. 

2 Requests for reprints should be sent to L. E. 
Jarrard, Department of Psychology, Washington and 
Lee University, Lexington, Virginia 24450. 


hippocampus is involved in human memory 
(Milner, 1968), and evidence from research 
suggests a role in response perseveration (see 
the reviews by Douglas [1967| and Kimble 
|1968]). It is the contention of the t 
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writer that the hippocampus also has a sig- 
nificant influence on nonspecific changes in 
activation or arousal and possibly some spe- 
сїйс types of incentive motivation (thirst and 
hunger). In addition, there is increasing evi- 
dence that different regions within hippo- 
campus may be involved in some functions 
more than others. Tt is the purpose of the 
present paper to review the evidence support- 
ing a motivational involvement of hippocam- 
pus and to speculate on the existence of func- 
tional dissociation within the structure. 

Only the neuroanatomical, electrophysio- 
logical, and neuroendocrine details that are 
most relevant to present arguments are pre- 
sented here since several excellent reviews are 
available (Green, 1964; Lorente de No, 1934: 
Stumpf, 1965). Most of the relevant behav- 
ioral data are from studies in which the rat 
has been used as a subject and the lesion tech- 
nique employed to experimentally manipulate 
the hippocampus. 


CONSIDERATIONS 
Neuroanatomical 


The major afferent and efferent connections 
of the hippocampus are with structures known 
to be important in motivation (Crage, 1961; 
Green, 1964; Lorente de No, 1934; Nauta, 
1958; Raisman, Cowan, & Powell, 1966). Be- 
cause of the close anatomical relationship be- 
tween hippocampus and hypothalamus, espe- 
cially medial hypothalamus (Nauta, 1958), 
one might expect the hippocampus to exert 
an important influence on hypothalamic 
mechanisms controlling hunger, thirst, and 
spontaneous activity, Further, there are direct 
hippocampal efferents to the central grey mid- 
brain substance (Nauta, 1958) j 
region (Raisman et al.. 1966). 
| Many of the early studies involved lesion- 
ing fornix with the assumption being that 
most, if not all, hippocampal efferents pass 
through fornix. However, there is neurophys- 
iological evidence for major hippocampotem- 
Poral connections in cat and monkey (Green 
& Adey, 1956; Votaw, 1960). In ‘addition, 
neuroanatomical evidence obtained by the 
Present writer using Cebus monkey and the 
Nauta stain demonstrated the existence of ex- 
tensive efferents from hippocampus to subic- 
ulum and entorhinal cortex (unpublished 


and the septal 


results). Hjorth-Simonsen (1971) has re- 
ported hippocampoentorhinal connections In 
the rat. These connections are apparently 
sparse since Raisman et al. (1966) did not 
find evidence for hippocampoentorhinal path- 
ways using the Nauta stain, and Hjorth- 
Simonsen (1971) found it necessary to facili- 
tate the search by employing preliminary 
operations on 8-day-old rats to deafferent 
hippecampus. Thus, the available evidence 
suggests that there may be important specie 
differences in distribution of hippocamp4 
efferents. E 
It has long been known that the hippo” 
campus can be divided into several сун 
architectonic fields (Lorente de No, 193 н 
1934). In the excellent study by Raisman a 
al. (1966), it was shown with the Nauta € 
that hippocampal cell fields in the rat hey 
differential connections to ехітарірросаши 
structures. They found that fibers n E 
dorsal fornix and the fimbria differ both Е 
distribution and origin. The dorsal fornix E | 
found to receive its fibers primarily from 
anterior part of hippocampal field CA1 K: 
to project via postcommissural fornix tO 
anterior thalamic nuclei and medial а 
lateral mammillary nuclei. Fields CA3 ie 
CA4 distribute their fibers through the fib | 
and precommissural fornix to the jen. 
septal nucleus and have reciprocal connects | 
with the medial septal nucleus and nuc 
of the diagonal band (Raisman, 1966). йй 
posterior part of field СА1 has fibers distr? 


t i abri 
uted through both dorsal fornix and йт, 


s 2 
thus, sending axons to both major areas С 
receive hippocampal efferents. On the ose 
of these data, Raisman et al. (1966) pro a 
a model in which there are two essen ain 
distinct neuronal systems, one correspo” 17% 

closely to the classical limbic system an aff 
cluding the anterior part of field CAL; yh 
the other consisting of fields CA3 and йе” 
medial and lateral septal nuclei, and n" 


of the diagonal band. | 


j 
‚ 


Flectro physiological pa! h 


Several investigators have suggested sd 
the hippocampus inhibits the reticular "m 
vating system (Adey, Segundo, & Livin? д] 
1957; Grastyan, 1959). Bremer, Stoupe ‘ap! 

VanReeth (1960) showed that hippoc | 
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stimulation had an effect on cortical evoked 
potentials which was opposite to that of 
stimulation of the reticular formation, thus 
suggesting a close antagonistic interaction 
between the two brain areas. More recently, 
Redding (1967) demonstrated that the hippo- 
campus has both an inhibitory influence on 
the brainstem reticular formation and a facili- 
tatory effect on nonspecific thalamic systems. 
The relations between hippocampal activity 
and various motivational states have been 
investigated. It was reported by Grastyan, 
Karmos, Vereczkey, and Kellenyi (1966) that 
pull (approach) and push (avoidance) re- 
actions were regularly accompanied by hippo- 
campal theta and desynchronization, respec- 
tively. Ford, Bremner, and Richie (1970) 
noted that electroencephalic activity in the 
hippocampal theta range was greater after 22 
hours than after 3 hours of food deprivation. 
In other research hippocampal theta de- 
creased during repeated exposure to a novel 
environment but was reinstated with both 
food and water deprivation and “blocked” 
during initiation of consummatory responses 
(Routtenberg, 1968). Routtenberg theorized 
that the hippocampus modulates two oscillat- 
ing subcortical systems consisting of the 
reticular formation (activity in this system 
being signaled by hippocampal desynchroni- 
zation) and the limbic system (activity ac- 
companied by hippocampal theta). 


Neuroendocrine 


The hippocampus appears to have an 
important influence on the neuroendocrine 
system and thus homeostatic regulation of 
physiological processes. Neuroanatomical pro- 
jections from hippocampus to the arcuate 
nucleus of the hypothalamus have been de- 
scribed (Nauta, 1956), and this cell group 
is involved in hormonal control. Mason 
(1958) studied the influence of various limbic 
structures on pituitary-adrenal activity and 
reported that destruction of hippocampus in- 
terrupted the usual diurnal rhythm of 17- 
hydroxycorticosteroids seen in normal mon- 
keys. While urinary levels of this steroid 
normally fall during the night, the levels re- 
mained elevated in monkeys with hippocam- 
pal lesions. Other investigators (Endroczi & 
Lissak, 1960; Feldman, Wajshort, & Bern- 


baum, 1967; Knigge, 1961) looked at ad- 
renalcorticosteroid and potassium output of 
the gastric secretory cells and concluded that 
the hippocampus has an inhibitory effect on 
secretion of these substances. In 1964 Green 
summarized the available evidence and stated 
that the hippocampus is inhibitory for the 
release of adrenocorticotropic hormone and 
possibly excitatory for the release of thyro- 
tropic hormone (Green, 1964). In view of the 
increases in pituitary-adrenal cortical activity 
during conditions of arousal and alerting 
(Mason, 1968), and the normal inhibitory 
effect of hippocampus on this system, experi- 
mental manipulations of hippocampus should 
have an effect on behaviors that are influenced 
by endocrine system functioning. 


Basic MOTIVATIONAL PROCESSES 
General Activity 

One change in behavior frequently reported 
by investigators following hippocampal lesions 
in rats is an increase in general activity 
(Douglas & Isaacson, 1964; Jarrard & Bun- 
nell, 1968; Kimble, 1963; Nadel, 1968; 
Sengstake, 1968; Teitelbaum & Milner, 
1963). However, not all investigators have 
found significant increases (Boitano, Lubar, 
Auer, & Furnald, 1968; Douglas & Isaacson, 
1964; Kaada, Rasmussen, & Kveim, 1961; 
Kim, 1960). These conflicting results are not 
surprising when one considers the multiple 
variations in activity-measuring devices and 
procedures that have been employed (see 
Strong & Jackson, 1970), and differences in 
size and location of the lesions. Another com- 
plicating factor is that most investigators have 
looked at activity for relatively short periods 
of time in novel situations. Animals with hip- 
pocampal lesions are often reported to habitu- 
ate more slowly than controls (Jarrard & 
Korn, 1969; Leaton, 1965; Roberts, Dember, 
& Brodwick, 1962), and this has been inter- 
preted as a deficit in response inhibition 
(Kimble, 1968). It is apparent that differ- 
ences in general activity, habituation, and ex- 
ploration would be confounded in these other 
studies. 

Behavior in the home cage and novel open 
field has been studied by Jarrard (1968). 
Groups did not differ in the home cage during 
the day, but hippocampectomized rats were 
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more active at night. (Similar results have 
been reported by Kim, Choi, Kim, Chang, 
Park, & Kang [1970].) In the open feld, 
hippocampals were more active than con- 
trols but habituated at the same rate. In 
a replication and extension of this research, 
Jarrard and Simsak? found that the differ- 
ential effect on day-night activity persisted 4 
months after the operations.’ Further, with 
continuous white light the usual 24-hour 
circadian rhythm increased the same as in 
control subjects (to approximately а 255-hour 
rhythm), and hippocampals readjusted to the 
usual 12-hour-day-12-hour-night lighting reg- 
imen as quickly as controls. However, hippo- 
campals were still more active during the 
time that control subjects were more active. 
Thus, it would appear that rats with lesions 
in hippocampus are more affected than con- 
trols by the novel, external stimuli found in 
the open field and by whatever the internal 
stimuli are that cause rats to be more active 
at night than during the day. 

Several investigators have recorded activity 
of rats with hippocampal lesions under differ- 
ent deprivation conditions (Gotsick, 1969; 
Niki, 1962; Sengstake, 1968). Although hip- 
pocampals were usually more active than con- 
trol subjects, the lack of differential effects 
with increasing deprivation agrees with the 
findings reported for normal rats (Bolles, 
1967), Related is a study by Schmaltz and 
Isaacson (1967) in which hippocampals were 
found to be more active than controls before 
being tested in a bar-pressing task but not 
after eating the daily food ration. 

When activity is recorded in situations that 
evoke goal-directed behavior, rats with hip- 
pocampal lesions are frequently more active 
than control subjects. Thus, increased inter- 
trial activity during development of avoidance 


. * L. E. Jarrard & M. A. Simsak. Changes in activ- 
ity, eating, and drinking following lesions of the 
hippocampus. Paper presented at the meeting of the 
Psychonomic Society, Saint Louis, October 1968. 

t The persisting hyperactivity in the home cage 
probably accounts for the lighter body weight and 
higher levels of liver glycogen recently reported to 
exist in rats with hippocampal damage 8 months 
following the operation. H. M. Murphy, C. Wideman 
Bes Brown. Liver glycogen levels in rats vit 
о lesions. Paper presented at the meeting of the 

ychonomic Society, San Antonio, October 1970. 


in the two-way shuttle box was noted by 
Green, Beatty, and Schwartzbaum (1967) for 
subjects with large bilateral hippocampal 
lesions but not subjects with less extensive 
damage. Kaplan (1968) reported no differ 
ences in acquisition of a conditioned freezing 
response, but during extinction subjects with 
hippocampal lesions were less perseverative 
than normal subjects, that is, they were more 
active. 1 | 
Grant and Jarrard (1968) employed chemi- 
cal stimulation of different hippocampal сё | 
fields in rats and looked at activity; eating 
and drinking. Stimulation with carbachol at 
all locations within hippocampus resulted M 
pronounced increases in activity with осо 
sional motor seizures. Similar results ha 
been reported by MacLean (1957) with elec 
trical stimulation, Hyperactivity and sni wi 
resulted from low voltage stimulation, M 
motor seizures and catatonic responses Ww 
curred during hippocampal afterdischaré 
following higher voltage stimulation. Gras | 
(1959) did not report hyperactivity 1n mi 
with low-level stimulation, but noted Я 
stimulation was followed by rebound oF te 
lease effects that were sufficient to active. 
previously inhibited conditioned respons (8 
Since the hippocampus is particularly suse (ў 
tible to epileptiform activity (Green, 1 nd 
the increases in general activity during n 
following stimulation may have been 
sult of seizural discharges. 


Reactivity to Stimuli | 


m 
While it is not possible to know the ex 
to which the increases in activity desc?! p 
above may be the result of increases de 
activity, there have been several studie? i 
signed to look specifically at how animals jul 
hippocampal lesions react to external sU as 
Hippocampal lesions do not result in ind? 0] 
in reactivity to presentation of either | n 
(Green et al., 1967; Kaplan, 1968) О pt 
tory (Gotsick, 1969) stimuli unless the е: <0" 
lation is intense (see Ireland & oq? p 
1968). Kamback (1967) found 510%! y 
uation in hippocampals bar pressin? роб, 
stimulus change under both ad-lib an E 
deprivation conditions. Because of t NM 
larity in behavior of satiated hippo a 
and food-deprived control subjects; ® 
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was drawn between the effects of hippocampal 
lesions and the effects of food deprivation on 
habituation to novel stimulation. 

In several studies reactivity to stimuli has 
been studied when the subjects are engaged 
in other ongoing behaviors. Thus, novel stim- 
uli have been presented during drinking 
(Hendrickson, Kimble, & Kimble, 1969), bar- 
pressing (Kaplan, 1968), and performance in 
a straightway (Raphelson, Isaacson, & Doug- 
las, 1965; Riddell, Rothblat, & Wilson, 1969; 
Wickelgren & Isaacson, 1963). Crowne and 
Riddell (1969) measured the cardiac compo- 
nent of orienting and found that hippocampal 
rats failed to show a normal response to novel 
stimuli when they were attending to other 
stimuli. The results of these studies show that 
subjects with hippocampal lesions are less 
distracted by extraneous stimulation when the 
subjects are engaged in goal-directed activity. 

Several investigators have been concerned 
with the possibility that subjects are more 
affected by noxious stimuli following lesions 
of hippocampus. This is especially important 
since avoidance from shock has been used in 
many experiments. Although it has been re- 
ported that threshold for reaction to shock 
does not usually change, hippocampals over- 
react to shock intensities that are well above 
threshold (Evans & Jarrard, unpublished re- 
sults; Schlosberg, Johnson, & Lash °). Blanch- 
ard and Fial (1968) determined shock 
thresholds for jump, flinch, vocalization, and 
crouching in rats with hippocampal lesions. 
While there were slight but significant differ- 
ences in thresholds (hippocampals jumped at 
a lower shock level and vocalized at higher 
levels), the most important finding was de- 
creased crouching following shock. With 
higher levels of shock, heart rate has been 
observed to increase in hippocampals follow- 
ing a single shock while the same conditions 
cause a heart rate decrease in control animals 
(Jarrard & Korn, 1969). McNew and Thomp- 
son (1966) observed that hippocampal sub- 
jects reacted more vigorously to shock during 
acquisition of an avoidance response. Thus, 
from the available evidence it would appear 


SA. J. Schlosberg, C. N. Johnson, & L. Lash. 
Response to shock by hippocampectomized rats. 
Paper presented at the meeting of the Eastern Psy- 
chological Association, Philadelphia, April 1969. 


that differences in shock threshold are small 
but subjects with hippocampal lesions react 
more than controls to shock levels that are 
above threshold. 


Drinking and Eating Behaviors 


The hippocampus seems to have an influ- 
ence on drinking and eating. Significant in- 
creases in ad-lib drinking have been reported 
following lesions of hippocampus (Glickman, 
Higgins, & Isaacson, 1970; Kimble & Coover, 
1966); however, other investigators found 
no change in water consumption (Boitano et 
al., 1968). No differences have been found 
either in amount of water consumed or rate 
of drinking following 48 hours of deprivation 
(Gotsick, 1969; Kaada et al, 1961). Fur- 


‘ther, Milgram (1969b) was unable to elicit 


drinking of water following electrical stimula- 
tion of dorsolateral hippocampus, but he did 
find that the rats would drink a liquid diet 
and a saccharin solution. Perhaps the most 
consistent finding relating to water consump- 
tion is induction of drinking following chemi- 
cal stimulation of hippocampus (MacPhail, 
1968; Mountíord, 1969). Fisher and Coury 
(1962) elicited drinking in rats with car- 
bachol stimulation of dorsomedial hippocam- 
pus. More recently, Grant and Jarrard (1968) 
obtained drinking with carbachol stimulation 
of anterior CA1 hippocampal cell fields but 
not CA3-CA4 cells. 

Rats with hippocampal lesions do пої 
usually consume more food than control ani- 
mals (Boitano et al, 1968; Gotsick, 1969; 
Jarrard, 1965; Niki, 1962), but significant 
increases have been reported (Kimble & 
Coover, 1966; Wishart, Brohman, & Mogen- 
son, 1969). While there were no differences 
in total amount of food consumed in the study 
by Jarrard and Simsak (see Footnote 3), the 
pattern of eating at night changed from eating 
in “meals” (where a meal is defined as con- 
tinuous eating for at least 2 minutes) to sig- 
nificantly shorter periods of eating interrupted 
by periods of activity. 

Milgram (1969a) obtained changes in feed- 
ing behavior following electrical stimulation 
of dorsolateral hippocampus. Stimulated rats 
would eat, perform a food-motivated response, 
and learn to press a bar for food. It is inter- 
esting that evidence for facilitated feeding 
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was not obtained during but only following 
termination of the stimulation. Thus, it was 
suggested that the hippocampus may normally 
inhibit feeding, and the facilitation was a 
result of rebound excitation of another struc- 
ture. Attempts to switch the rebound feeding 
to drinking of water, gnawing, and mouse 
killing were not successful (Milgram, 1969b). 

Increases in eating have been reported 
following chemical stimulation of hippocam- 
pus. Coury (1967) obtained eating in rats 
with norepinephrine stimulation of dorso- 
medial hippocampus but not anterior and 
lateral hippocampal areas. More recently, 
Grant and Jarrard (1968) found that stimu- 
lation with either norepinephrine or carbachol 
at all hippocampal cell fields increased food 
consumption over sham stimulation levels. 
The explanation for these discrepant results 
is not apparent. 


Choice, Preference, and Competition 


There do not appear to be any significant 
changes in simple and preferential consum- 
matory behavior following damage to hippo- 
campus. Beatty and Schwartzbaum (1968) 
and the present writer (unpublished results) 
found that rats with hippocampal lesions 
failed to exaggerate consummatory reactivity 
to positive (sucrose and saccharin) and nega- 
tive (quinine) properties of solutions. Al- 
though changes in consummatory behavior 
were reported by Murphy and Brown (1970) 
following lesions of hippocampus, the authors 
considered the changes to be secondary to 
observed increases in activity. 

Jarrard ° employed a paired-comparison 
procedure that forced subjects to choose be- 
tween being in a compartment with food, with 
another rat, or in a novel situation. Food- 
deprived rats with hippocampal damage spent 


more time with the food and were more active 
than control subjects. 


MOTIVATION AND LEARNING 

A problem frequently overlooked in experi- 
mentally testing the hypothesis that the hip- 
pocampus is primarily concerned with re- 
а алаас 
th of preference for food in 
phalic lesions. Paper presented at 
e Southern Society for Philosophy 


and Psychology, Atlanta, March 1963. 


sponse inhibition or learning and memory 1S 
that differences in motivation (activity, hun- 
ger, thirst, reactions to novel and noxious 
stimuli) can result in differences in learning 
or performance of a learned task. One might 
expect facilitated or impaired performance to 
the extent that hippocampectomy results 17 
animals emitting responses that are com 
patible or incompatible with learning the task. 
Differences in drive do not usually affect 
acquisition of simple tasks in normal animals, 
but complex learning is affected (Berlyne 
1968; Bindra, 1959). In rats with damage 
to hippocampus there is usually no impatt- 
ment in acquisition of simple tasks, including 
straight runway (Jarrard, Isaacson, & Wickel- 
gren, 1964; Niki, 1962), bar pressing on 4 
continuous reinforcement schedule for foo 
or water (Jarrard, 1965; Schmaltz & Isaac 
son, 1967), and simple visual and spatia 
discriminations (Kimble, 1963; pers 
Kimble, 1965). Musty (1969) did not "n 
differences in a simple avoidance task, d 
hippocampals were impaired in acquisition i 
a more complex pattern discrimination. Mi 
cits in hippocampectomized rats are иша 
found in acquisition of complex tasks sv 
as mazes (Jarrard & Lewis, 1967; Kaada ıı 
al., 1961; Thomas, 1971), and monkeys w 
hippocampal lesions have trouble lear! 
sequential responses (Kimble & Pribra 
1963) and delayed alternation (Rosvol4 
Szwarcbart, 1964). ۳ 
Subjects with hippocampal Jesions ha 
been reported to press a lever at higher u^ 
than control rats with “moderate” level J$ 
deprivation but not at higher or lower eg) 
(Musty, 1969). Kimble and Greene (19 ;n 
found little evidence for latent learning uy 
hippocampectomized subjects. One would ed 
pect these results with highly moti jf 
subjects since high drive has the effec. eg 
narrowing the range of cues to which sub) $51 
respond (Bruner, Matter, Papanek, oni) 
Easterbrook, 1959; Telegdy & Cohen, 1 “tl 
and amount of latent learning is met 3): 
related to strength of drive (Johnson, e al 
Reports of less distractibility in hippoc® iss 
subjects (Raphelson, Isaacson, & Dor a 
1965; Wickelgren & Isaacson, 1963) il 
be due to a narrowing of the range of | 
to which highly motivated subjects atte? 


ning 
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Since hippocampals react more than con- 
trols to above-threshold levels of shock (see 
above), one might expect differences in learn- 
ing and performance when shock avoidance 
is used as the incentive. Blanchard and Fial 
(1968) suggested that an observed decreased 
crouching following shock may account for 
the often-reported passive avoidance deficit 
characteristic of rats with hippocampal le- 
sions. Musty (1969) found an impairment of 
hippocampal subjects in pattern discrimina- 
tion with high shock levels but not when low 
shock was employed. In a related unpublished 
study the present investigator found only a 


slight impairment in hippocampectomized 


subjects in reversal of a Y-maze position 
habit for food but a severe impairment when 
shock avoidance was used. 

Several investigators have studied the in- 
volvement of hippocampus in learning by 
employing stimulation techniques. Correll 
(1957) used subthreshold electrical stimula- 
tion of hippocampus in cats while Grossman 
and Mountford (1964) induced spreading 
depression in hippocampus of rats during 
learning. The results of both studies were 
interpreted as suggesting that interference 
with hippocampal functioning may affect per- 
formance variables such as motivation rather 
than associative mechanisms related to ac- 
quisition or retention. 

It is apparent from the above studies that 
a motivational explanation must be ruled out 
before impaired (or facilitated) performance 
of animals with hippocampal lesions can be 
attributed to deficits in other mechanisms. 


FUNCTIONAL DISSOCIATION WITHIN 
HIPPOCAMPUS 


One of the most important questions now 
facing investigators studying hippocampus 15 
whether the complex structure is homogeneous 
in function with all parts being equally im- 
Dortant in influencing behavior, or whether 
there are functional differences associated 
With known neuroanatomical differences. It 
has been assumed by many, Spay pom 
working primarily at a behavioral leve owe 
the hippocampus functions a5 а unit. i » 
Size and location of a lesion (or stimulating 
electrode) within hippocampus are ap 
ally seen as being important. Many о 


contradictions found in the current literature 
would probably be resolved if it could be 
shown that functional dissociation does exist. 
Neuroanatomical (Nauta, 1956; Raisman 
et al., 1966), neurophysiological (Adey, 1961; 
Brazier, 1970; Elul, 1964; MacLean, 1959; 
Votaw & Lauer, 1963), and neurochemical 
(Purpura, 1959) evidence suggests that the 
hippocampus has a topical organization. As 
previously described, Raisman et al. (1966) 
proposed a model in which there are two 
neuronal systems—one primarily involving 
anterior CAI cell fields and the second fields 
САЗ and CA4. The main problem in attempt- 
ing to functionally test the model is the 
complex manner in which the various hippo- 
campal fields are arranged. However, the rat 
is a good subject to use in attempts to lesion 
or stimulate selected fields since CA1 is an 
outermost layer, and the majority of fields 
САЗ and CA4 are located in the caudal half 
of the hippocampus (see Raisman, 1966). 
Although lesions to anterodorsal or postero- 
ventral regions would not be limited exclu- 
sively to sites of origin of the two major 
projections, with these selective lesions there 
should be differences in the extent to which 
the two main projections are involved. 
Several experiments have been reported in 
which lesions were limited to different parts 
of hippocampus (Jackson, 1968). In one of 
the earliest, Kimura (1958) damaged either 
the anterior or posterior hippocampus of rats 
and reported that posterior lesions interfered 
with acquisition of a passive avoidance re- 
sponse while anterior lesions had no effect. 
Rabe and Haddad (1969) and Kaada et al. 
(1962) found no evidence for differential 
effects of dorsal and ventral lesions on acquisi- 
tion of two-way and passive avoidance re- 
sponses, respectively. Nadel (1968) reported 
that ventral lesions affected rate of habitua- 
tion of exploratory responses, retarded extinc- 
tion of a conditioned emotional response, and 
facilitated acquisition of a one-way active 
avoidance response. Dorsal hippocampal 
lesions did not affect rates of habituation, but 
did facilitate acquisition of a conditioned emo- 
tional response, and retarded acquisition of a 
one-way active avoidance response. Further, 
rats with either dorsal or ventral hippocampal 
lesions were hyperactive in a novel situation. 
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There have been several attempts to study 
hippocampal functions by selectively damag- 
ing pathways associated with the structure. 
Liss (1968) limited lesions to either lateral 
fornix or posteroventral portions of hippo- 
campus and found impaired one-way active 
and passive avoidance, and facilitated shuttle- 
box learning. Since the midline fornix re- 
mained intact, the author felt that disruption 
was primarily in fibers from posteroventral 
hippocampus and not the anterodorsal region. 
MacDougall, Van Hoesen, and Mitchell 
(1969) compared large fornix lesions that 
disrupted both dorsal fornix and fimbria 
fbers with smaller lesions restricted to the 
medial aspects of fornix. Marked deficits in 
performance of an operant response (differ- 
ential reinforcement for low rates of respond- 
ing) were found in subjects with complete 
lesions of fornix but not in subjects with 
limited dorsal fornix lesions. The authors sug- 
gested that the septal connections with hippo- 
campal fields CA3-CA4 via fimbria are part 
of a system important in response inhibition 
and timing behavior. More recently, Thomas 
(1971) compared subtotal hippocampal le- 
sions with fornix lesions and found that both 
groups of subjects were significantly impaired 
in retention and savings in relearning a com- 
plex maze. Damage to fornix, while not 
limited to postcommissural fornix, was more 
extensive in the descending columns. 

It is possible that more precise experimental 
manipulations of the different hippocampal 
cell fields can be obtained with chemical and 
electrical stimulation. Grant and Jarrard 
(1968) studied the effects of adrenergic and 
cholinergic stimulation of anterodorsal and 
posteroventral CA1 and CA3-CA4 cell fields 
on drinking, eating, and activity. Control of 
drinking was found to be dissociated both on 
à neuroanatomical (limited to anterodorsal 
CAI cell field) and biochemical (limited to 
cholinergic stimulation) basis. However, neu- 
Toanatomical location was not important in 
the effects on eating and activity. Interest- 
ingly enough, subjects were more active fol- 
lowing carbachol stimulation at all hippo- 
campal placements, thus suggesting a general 
Increase in level of activation with no dis- 
sociation. Milgram (1969a, 1969b) reported 
significant increases in feeding behavior with 


electrical stimulation of lateral dorsal hippo- 
campus. The effect was less pronounced or 
absent with stimulation of other hippocampal 
areas. The reason їог the apparent conflict 
that exists between the effects of lesions and 
of chemical and electrical stimulation of 
selected hippocampal cell fields is not clear. 

Most of the available evidence on func- 
tional dissociation in hippocampus has been 
obtained in the rat. However, there are several 
studies suggesting that different hippocampal 
regions in the cat are not functionally equal. 
Andy, Peeler, and Foshee (1967) found that 
cats with complete lesions and those with 
lesions limited to dorsal hippocampus were 
impaired in retention of an avoidance re- 
sponse, whereas subjects with ventral Jesigi 
were similar to controls. Similar effects E 
fornix and entorhinal lesions on learning anc 
retention were reported by Uretsky ^ 
McLeary (1969). Differential effects w! 
electrical stimulation of dorsal and ks 
regions have been reported on attack behavi f 
(Siegel & Flynn, 1968), and changes in w 
pressure have been reported in cats (AP У, 
Webster, Mukawa, & Bonn, 1961) and s 
keys (Votaw & Lauer, 1963). Further, ver 
hippocampal regions in the cat differ in ter 
of electrical activity (Adey, 1961; MacLe2 
1959). It is difficult to compare the results © 
the above investigations with those in wh!¢ 
cats and those in which rats have been b 
as subjects since comparable data on pro 
tions of the various hippocampal cell fie B 
do not exist for the two species. Even m 
important is the possible existence of sae 
hippocampal entorhinal projections in the £ 
(see above). 


CONCLUSIONS 


as 
It would appear that the hippocampus va 
an important influence on several behav” in 
processes. In addition to being importa ^... 
response perseveration in the rat (Dod 
1967; Kimble, 1968), the evidence revi; 
in this paper suggests that the һїрросалЁ g, 
is intimately involved in motivation. Sp 
cally, it is proposed here that the hippo р 
pus functions as а negative feedback үе wer 
anism serving to inhibit the activity of on 
Structures with which it is reciprocally en 
nected. Especially important is the influ 
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that the hippocampus has on structures con- 
trolling nonspecific levels of activation or 
arousal. It is hypothesized that experimental 
manipulations of hippocampus (lesions, chem- 
ical and electrical stimulation) result in non- 
specific changes in activation or general drive 
that affect the way the subject reacts to the 
environment. Thus, following damage to hip- 
pocampus, level of activation or drive should 
be increased, “general motor readiness" 
(Bindra, 1968) should be enhanced, and 
under conditions of low stimulation the sub- 
ject should be more responsive to both ex- 
ternal and internal stimuli. Performance in 
learning tasks (especially complex tasks) 
should be affected since high levels of activa- 
tion or drive serve to narrow the range of 
cues used by subjects (Easterbrook, 1959; 
Telegdy & Cohen, 1971) and complex tasks 
usually require difficult sensory discrimi- 
nations. 

While there is evidence to suggest that the 
hippocampus may also be involved in the 
modulation of some specific types of drive 
or incentive motivation (especially drinking 
and eating), the exact nature of the involve- 
ment is not clear. Conflicting results have 
been obtained concerning the effects of lesions 
on water and food consumption, but there are 
changes in the pattern of drinking and eating 
following lesions of hippocampus, and con- 
sistent increases in consumption are reported 
with chemical stimulation. 

The neuroanatomical, neurophysiological, 
and behavioral evidence reviewed above would 
appear to support the hypothesis that there 
is functional dissociation within hippocampus 
in the rat. Specifically, the anterodorsal CA1 
cell field that projects via dorsal and post- 
commissural fornix to hypothalamus and an- 
terior nucleus of the thalamus seems to be 
involved in incentive motivation. Thus, ex- 
Perimental manipulations of this area have 
been reported to affect drinking (Grant & 
Jarrard, 1968) and eating (Milgram, 1969a, 
1969b). The CA3-CA4 hippocampal cell fields 
that send axons through the fimbria to Ше 
Septal area and diagonal band nuclei are more 
concerned with the inhibiting of responses 1, 
Tesponse perseveration (MacDougall et ч i 
1969), The evidence further suggests tha 
both areas may have some functions in com- 
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mon, for example, the involvement in general 
arousal (Grant & Jarrard, 1968; Nadel, 
1968). Thus, it is possible that all cell fields 
contribute to the influence that the hippo- 
campus has on the undifferentiated state of 
activation or arousal. 

The above formulations provide a basis for 
predictions concerning the behavioral effects 
of experimentally manipulating the hippocam- 
pus. Confirmation or refutation of these pre- 
dictions should contribute to the development 
of a more comprehensive theory of hippocam- 
pal function. 
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HYPOTHALAMIC FEEDING MECHANISMS AND 
AMPHETAMINE ANOREXIA * 


SHERWOOD O. COLE ? 


Rutgers—The State University, Camden 


The hypothalamic regulation of feeding is discussed in terms of the ventro- 
medial hypothalamic “satiety center" and the lateral hypothalamic “feeding 
center.” Two hypotheses concerning the anorexic properties of amphetamine 
are considered in terms of the drug’s interaction with hypothalamic centers—a 
ventromedial-hypothalamic-mimicking hypothesis and a lateral-hypothalamic- 
blocking hypothesis. A consideration of these hypotheses suggests that the drug 
influences both centers, and that additional sites in the cerebral cortex and 
brainstem also may serve ancillary functions in mediating the drug's effect on 
feeding. It is suggested that future research should attempt to clarify the 
mediational processes underlying the drug's effect on feeding and nonfeeding 
performance and, in those instances where such processes overlap, to specify 


the nature of the overlap. 


The general depressant effect of ampheta- 
mine on feeding and food-motivated behavior 
is well established (Cole, 1967, 1970). Carl- 
ton (1963) has suggested that amphetamine 
anorexia may be due to an increase in activa- 
tion to the degree that responses incompatible 
with those required in feeding occur and thus 
result in a decrease in feeding. A second pe- 
ripheral view of the action of amphetamine 
on feeding (Mogenson, 1968) suggests that 
the drug reduces liver and muscle glycogen 
and thereby increases the intracellular glucose 
in hepatic cells that causes gluco-receptors in 
the liver to signal a state of satiety. Such 
peripheral interpretations must be given 
proper consideration, but they do not explain 
the changing effectiveness of amphetamine 
on feeding following central nervous system 
lesions (Cole, 1967, 1968). Furthermore, am- 
phetamine has been observed to depress feed- 
ing in the complete absence of peripheral 
changes in the circulatory system (Nathan- 
son, 1937) and after interruption of nerve 
pathways regulating the gastro-intestinal 
tract (Harris, Ivy, & Searle, 1947). 
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The purpose of the present paper is (a) 
to review briefly the hypothalamic mecha- 
nisms regulating feeding and (b) to consider 
the anorexic action of amphetamine in terms 
of the drug's interaction with such mecha- 
nisms. 


HYPOTHALAMIC FEEDING MECHANISMS * 


Two specific regions of the hypothalamus 
are of major importance in the regulation of 
feeding—the ventromedial and lateral regions. 
'These regions have been referred to as the 
“satiety center" and “feeding center" of the 
hypothalamus, respectively (Anand & Bro- 
beck, 1951). 


Ventromedial Hypothalamic Satiety Center * 


Brobeck, Tepperman, and Long (1943) 
demonstrated that electrolytic lesions in the 
ventromedial hypothalamus produced an in- 
crease in feeding by rats. Several subsequent 
studies (Brooks, Lockwood, & Wiggins, 1946; 
Liebelt & Perry, 1957; Marshall, Barrett, & 


3 For an updated review on the hypothalamic con- 
trol of feeding that appeared during the preparation 
of this paper, see Hoebel (1971). 

4 Reynolds (1963, 1965) has questioned the exis- 
tence of a specific ventromedial hypothalamic “satiety 
center” and has hypothesized that electrolytic lesions 
in this area deposit metallic ions that produce scar 
tissue and have irritating properties to adjacent lat- 
eral hypothalamic areas. However, there is evidence 
that hyperphagia can be produced by destruction of 
the ventromedial hypothalamus in the absence of ir- 
ative effects (Gold, 1970a; Hoebel, 1965). 


rit 
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Mayer, 1955; Wagner & de Groot, 1963) 
supported these findings with bilateral lesions 
of the ventromedial hypothalamus, either elec- 
trolytically or chemically introduced, enor- 
mously exaggerating the feeding of experi- 
mental animals (hypothalamic hyperphagia). 
Such a lack of inhibition to eat with ventro- 
medial hypothalamic lesions apparently in- 
volved a failure to adjust food intake to the 
energy demands of the body (Brobeck, 1946; 
Strominger, Brobeck, & Cort, 1953). Recent 
evidence suggests that although both male 
and female rats demonstrate an acute dy- 
namic period immediately following surgery, 
followed by a more chronic dynamic period 
(Balagura & Devenport, 1970), the actual 
amount of hyperphagia resulting from ventro- 
medial hypothalamic lesions may vary with 
sex (Balagura & Devenport, 1970; Cox, Ka- 
kolewski, & Valenstein, 1969). However, such 
differences may be due to a failure to equate 
sexes for weight (Gold, 1970b) or to a failure 
to take into consideration the influence of 
day-night differences in sexual adjustment 
(Balagura & Devenport, 1970). Although 
Studies by Hetherington (1943, 1944) sug- 
gested that the critical region involved in the 
production of hyperphagia is the ventromedial 
nucleus, Graff and Stellar (1962) observed 
hyperphagia following bilateral hypothalamic 
lesions which spared the ventromedial nucleus 
but destroyed the area just posterior to it. 
Evidence for the satiety function of the 
ventromedial hypothalamus also was found 
by Wyrwicka and Dobrzecka (1960), with 
electrical stimulation of the ventromedial 
hypothalamus inhibiting the food intake of 
experimental subjects, 


Lateral. Hypothalamic Feeding Center 


In a study by Delgado and Anand (1953), 
electrical stimulation of the nucleus of the 
lateral hypothalamus by means of chronically 
implanted electrodes Produced an increase in 
food consumption: however, this increase was 
observed only after several days of da 1. 
hour stimulation periods, 
(Devor, Wise, 


aily 1- 
Recent evidence 
Hoebel, 1970) 
lor elicited by elec. 
А rigid nor undi- 
"Ubporting the view that lateral 
stimulation can trigg 


ger neural 
regulate feeding. Additional 


hypothalamic 
substrates tha 


support for the facilitatory role of the lateral 
hypothalamus in regulating feeding was found 
in studies by Teitelbaum and Stellar (1954), 
by Anand and Brobeck (1951), and by Feld- 
man, Larsson, Dimick, and Lepkovsky (1957) 
where bilateral lesions of the lateral hypo- 
thalamus resulted in either hypophagia ог 
aphagia. However, there has been some ques- 
tion as to whether this deficit is motivational 
(Rodgers, Epstein, & Teitelbaum, 1965) ог 
due to a disruption of motor systems 1n- 
volved in feeding (Morrison, 1968). On a 
few occasions, experimental subjects made 
hypophagic or aphagic by lateral hypothalamic 
lesions have been observed to recover eating 
spontaneously, but such subjects usually 
starve to death or recover only with special 
diets (Teitelbaum & Stellar, 1954; Williams 
& Teitelbaum, 1959), Teitelbaum and Ep- 
stein (1962) have reported a comprehensive 
study of the recovery stages of feeding fol- 
lowing lateral hypothalamic lesions. Recent 
evidence also suggests that the postoperative 
recovery of lateral-hypothalamus-lesioned 
subjects depends upon preoperative feeding 
experience (Di Cara, 1970) and preoperative 
body weight (Powley & Keesey, 1970), and 
that there may be a sex difference in the 
recovery from such lesions (Cox & Kakolew 
ski, 1970). 


Additional Considerations 


Anand and Brobeck (1951) demonstrated 
that bilateral lesions involving the ventro” 
medial hypothalamic region or the region E 
tween it and the lateral region (but not 
volving the lateral region) produced hype 
phagia and obesity, This hyperphagia ges 
when the lateral hypothalamic regions WÊ d 
destroyed bilaterally. The authors interpret 
the findings to indicate that the ventromed! g 
hypothalamic region is capable of exert 07 
its inhibitory influence over the lateral hye s 
thalamic facilitatory activity through ". 
that run laterally into the lateral re? 
Other evidence by Albert and Storlien (1 jet 
by Gold (1970a), and by Агеев and №: e 
(1967) suggest that fibers do interact po” 
tween the ventromedial and the lateral P? 
thalamus in the regulation of feeding: ie 
dence also suggests that the hypoth? 101 
, centers” interact with other cortical 7 aum 
m the regulation of feeding. Kirsc? 
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id observed some marked changes in the 
Eer and eating of patients with various 
lypes of organic lesions of the cerebral cor- 
tex. Also, Pribram and Bagshaw (1953) ob- 
Served changes in feeding following ablation 
^i the temporal polar-amygdaloid cortex, and 
Tee Rosvold, and Pribram (1957) ob- 
pu striking increases in food intake fol- 
‘ ing amygdalectomy. Recent evidence sug- 
E that changes in feeding following dam- 
gd o temporal lobe sites may be secondary 
"e E rather than due to incentive shifts, 
e е such sites have their influence on 
dels ng by a modulation of the activity of 
1959. the hypothalamus (Murphy & Brown, 
Whit Sclafani, Belluzzi, & Grossman, 1970; 
re x & Fisher, 1969). Other evidence based 
Df Ih e similar effects of chemical stimulation 
e amygdala and hypothalamus on feeding 
(Grossman, 1960, 1964) is consistent with the 
lew that these sites interact in the regulation 
of feeding, 


HYPOTHALAMIC MECHANISMS UNDERLYING 
ACTION OF AMPHETAMINE 


i basic hypotheses concerning the mech- 
pss ms underlying the depressant effects of 

Nphetamine on feeding have been suggested 
Y experimental findings—a ventromedial- 
ec oe hypothesis and a 
Men a -hypothalamus-blocking hypothesis. An 
iter is made to evaluate the merits and 

itations of these hypotheses in handling 


the experimental data. 


Ventromedial Hypothalamus-Mimicking 
Hypothesis ` 
In a study by Brobeck, Larsson, and Reyes 
imo, electrical recordings were taken in 
res rent subcortical regions of the brain in 
уи ќо amphetamine injected intrave- 
e in anesthetized cats. Since the ventro- 
ini ial hypothalamic region responded to the 
jections with increased electrical activity, 
e results suggested that amphetamine de- 
Pressed eating by chemically mimicking the 
etd of the ventromedial Po 
Bi center,” Моге recently, 3 
Indra, and Campbell (1969) found that am- 
eo increases the firing rate of single 
ie units in the ventromedial hypothala- 
Stet and Carr and Moore (1969) demon- 
ated that amphetamine releases norepi- 


nephrine from the ventromedial hypothala- 
mus. Since Foxwell, Funderburk, and Ward 
(1969) found that another anorexic agent 
(fenfluramine) increases electrical activity in 
the ventromedial hypothalamus, and Wade 
and Zucker (1970) demonstrated that an 
estrogen stimulation of the ventromedial hy- 
pothalamus also decreases feeding, an applica- 
tion of the ventromedial-hypothalamus-mim- 
icking hypothesis apparently is not ampheta- 
mine specific. 

One basic assumption of the ventromedial- 
hypothalamus-mimicking hypothesis is that 
if the ventromedial hypothalamic region is 
destroyed by lesioning, the depressant effects 
of amphetamine on feeding should be at- 
tenuated or eliminated. A study by Sharp, 
Neilson, and Porter (1962), in which ventro- 
medial hypothalamic lesions attenuated the 
depressant action of amphetamine on feeding 
in cats, is consistent with this assumption. 
However, other studies (Epstein, 1959; Reyn- 
olds, 1959) have found quite the opposite 
effect; that is, the lesions exaggerated the 
depressant effect of the drug on feeding? 
Although these latter findings suggest the 
need for rejecting the ventromedial-hypothal- 
amus-mimicking hypothesis, a consideration 


of additional facts might still allow one to 
у with some qualifications. 


accept such a viev 
Margules (1969, 1970) has suggested that 


the exaggerated amphetamine effect following 
ventromedial hypothalamic lesions in rats 
may represent a denervation supersensitivity 
of remaining alpha-adrenergic sites. Thus, 
amphetamine may depress feeding by mimick- 
ing the activity of the ventromedial hypo- 
thalamic “satiety center" in the intact brain, 
but the assumption underlying the reversal of 
this action with ventromedial hypothalamic 
lesions (ie. attenuation or elimination of 
the drug’s effect) does not necessarily hold 
because of a changed sensitivity of remaining 
sites, Provided one accepts the view that am- 
phetamine does not mediate its effect on feed- 
ing solely by mimicking the activity of the 


5Stowe and Miller 
a similar exaggerated е 
ing ventromedial hypothala: 


crepancy in these findings per ү 
llowing ventromedial hypothalamic 


of amphetamine fo o 
lesions has been pointed out by Mever and Meyer 


(1963). 


(1957) previously had found 
ffect of amphetamine follow- 
mic lesions. The dis- 
taining to the effects 
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ventromedial hypothalamic “satiety center,” 
the concept of denervation hypersensitivity 
has some merit in explaining the exaggerated 
action of the drug following ventromedial 
hypothalamic lesions. One might also suggest 
that other findings of an exaggerated effect 
of amphetamine on feeding following anterior 
hypothalamic lesions (Cole, 1966) are due to 
a similar denervation hypersensitivity. The 
concept of denervation hypersensitivity does 
not explain the findings of Sharp et al. (1962) 
that ventromedial hypothalamic lesions at- 
tenuated the effects of amphetamine on feed- 
ing. Since cats were used in the study by 
Sharp et al., in which support was found 
for the ventromedial-hypothalamus-mimicking 
hypothesis, and rats were used in the studies 
that did not support the hypothesis (Epstein, 
1959; Reynolds, 1959), a species-specific ex- 
planation of the discrepant findings cannot 
be ruled out. However, a species-specific ex- 
planation does not hold when considering the 
fact that in the intact brain the drug has been 
observed to increase the firing rate of neural 
units in the ventromedial hypothalamus of 
cats (Brobeck et al., 1956) and of rats (Krebs 
et al., 1969) in much the same manner. As 
an alternative explanation of such discrepant 
findings, Carlisle (1964) has suggested that 
the study by Sharp et al. may not be a valid 
test of the changing effectiveness of ampheta- 
mine following ventromedial hypothalamic 
lesions, since food consumption was measured 
over a 24-hour period rather than during a 


restricted period representing the peak action 
of the drug. 


Lateral H ypothalamus-Blocking Hypothesis 


Booth (1968) has demonstrated that can- 
nula-guided chemical stimulation of the lateral 
hypothalamus resulted in a consistency of 
correlation between the effectiveness of nor- 
adrenalin to elicit cating and the effectiveness 
of amphetamine to depress eating. These re- 
sults suggested that amphetamine mediates 
Its anorexic action on feeding by the blocking 
of adrenergic activity in the lateral Bano: 
thalamic "feeding center." Basic to eo 
blocking may be a direct stimulation of beta- 
adrenergic cells in the lateral hypothalamus 
(Leibowitz, 1970) or an increase in the 
threshold of excitability in the “feeding cen 
ter," since the drug has been found to ч 


crease the threshold for eating by stimulation 
of the lateral hypothalamus (Miller, 1960; 
Stark & Totty, 1967). Additional evidence 
that the anorexic effects of amphetamine are 
counteracted by electrical stimulation of the 
lateral hypothalamus (Thode & Carlisle, 1968) 
and that amphetamine decreases the am- 
plitude of elicited lateral hypothalamic ac- 
tivity (Reiter, 1970) also are consistent with 
the view that the drug depresses feeding by 
decreasing lateral hypothalamic excitability. 

As was true with the ventromedial-hypo- 
thalamus-mimicking hypothesis, a basic as- 
sumption underlying the lateral-hypothala- 
mus-blocking hypothesis is that if the lateral 
hypothalamic region is destroyed by lesion- 
ing, the depressant effects of amphetamine on 
feeding should be attenuated or eliminated. 
The findings by Carlisle (1964) that the de- 
pressant effect of amphetamine was attenu- 
ated in lateral-hypothalamus-lesioned sub- 
jects who had recovered from postoperative 
aphagia-adipsia support such an assumption. 
Since the lateral-hypothalamus-lesioned sub- 
jects and the control subjects did not differ 
significantly with respect to the action of the 
drug on water intake, the changing effective- 
ness of amphetamine following such lesions 
cannot be attributed to indirect influences 0? 
feeding behavior. 

Tn addition, the lateral-hypothalamus-block- 
ing hypothesis would seem to provide a mean- 
ingful organization of the exaggerated effects 
of amphetamine on feeding following ventro- 
medial hypothalamic lesions (Epstein, 1959: 
Reynolds, 1959) in terms of changing thresh- 
old properties of the lateral hypothalami® 
"feeding center." There is evidence he 
electrolytic ventromedial hypothalamic lesion 
produce the typically observed hyperphag" 
syndrome by the releasing of inhibitory !" 
fluences through fibers running laterally " 
the lateral hypothalamus from the vent 
medial hypothalamus (Anand & Brotier 
1951). Thus, the exaggerated effect of E- 
phetamine may be due to changes in the 4iC 
active sensitivity of the lateral hypothala? ү 
"feeding center" produced by the releas® h, 
ventromedial hypothalamic restraint, M" 
in turn, increases the effectiveness of ШЕ d d 
in raising the lateral hypothalamic th" gs 
of excitability. Although a similar but he 
direct “release of restraint” may underly 
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p action of the drug following an- 
Eu өш lesions (Cole, 1966), 
a b. d 1C nature of the release still remains 
E e etermined. Since anterior-hypothala- 
ко subjects do not characteristically 
т А мр they are heat stressed (Hamil- 
fou. x ), the exaggerated action of the 
oe a be attributed to a simple “re- 
might چ‎ on feeding motivation аз 
by чарча be done with changes produced 
esr Pg hypothalamic lesions. How- 
located ers of the medial forebrain bundle are 
A in the anterior hypothalamus and 
tivity pe functions in regulating the ac- 
Ру the lateral hypothalamic “feeding 
Eu (Morgane, 1961a, 1961b); thus, 
with p restraint" on such connections 
ог oW feeding. center" resulting from an- 
direci ypothalamic lesions might produce in- 
on So the same kind of exaggerated effect 
E ps action as is produced more 
The | y by ventromedial hypothalamic lesions. 
ама Lacan aaa а hypothesis 
tion Lary to provide a meaningful organiza- 
Mies the exaggerated effect of amphetamine 
fos ыа; ventromedial hypothalamic lesions 
of es е reason as well, The proposed site 
Fisch etamine's action (lateral hypothalamic 
бурт center") is intact and could become 
medi е ог less restrained, thereby 
dru ating an increased effectiveness of the 
x In contrast, an organization of such 
min ence by the ventromedial-hypothalamus- 
уе nicking hypothesis is less clear, since the 

Ty structure hypothesized to mediate the 
Tug’s action (the ventromedial hypothala- 


mus) ; 
Us) is no longer intact. 


7 
uw Comments on the Mediation of Am- 
mine Anorexia 
A ogl it would be convenient and cer- 
с tie to find that amphetamine 
ее es its depressant effect on feeding by 
ane the ventromedial hypothal- 
i satiety center" or by blocking activity 
the lateral hypothalamic “feeding center, 
mano suggests that both regions ate 
anor ved in the mediation of the drug’s 
a properties, While the evidence 9 
and/o from the intact brain (via recording 
Previo: Stimulation) supports both of the 
ess us hypotheses, the lesioning evidence 
clearly supports either. The findings of 


nan et al. (1962) support the ventromedial- 
ypothalamus-mimicking hypothesis, and the 
Шр. of Carlisle (1964) support the lateral- 
ypothalamus-blocking hypothesis; however, 
the exaggerated action of the drug following 
ventromedial hypothalamic lesions (Epstein 
1959; Reynolds, 1959) might possibly be 
plained by either hypothesis, provided one 
is willing to entertain some additional as- 
sumptions. Furthermore, as pointed out by 
Kreb et al. (1969), even the evidence that 
favors the ventromedial-hypothalamus-mim- 
icking hypothesis does not necessarily rule 
out the possibility that the drug produces its 
depressant effect on feeding by acting on the 
lateral hypothalamic “feeding center.” 
Although studies (Glick, 1969; Glick & 
Jarvik, 1970; Jonason, Lauber , Robbins, 
Meyer, & Meyer, 1970; Novick & Pihl, 1969) 
have investigated the combined effects of am- 
phetamine and Jesions in sites other than the 
hypothalamus on learning and performance, 
little evidence is available on the role such 
sites play in mediating the anorexic action of 
the drug. What evidence there is suggests that 
the role of cortical and other subcortical sites 
in regulating the drug’s effect on feeding 
parallels the role of such sites in the regula- 
tion of general feeding, Harris et al. (1947) 
observed that human subjects with frontal 
lobotomies were less susceptible to the anorexic 
properties of amphetamine than were normal 
subjects, and Andersson and Larsson (1957) 
observed similar changes in the depressant ef- 
fect of the drug on feeding in dogs following 
prefrontal lobotomies. Recently, Carey and 
Salim (1970) found that septal lesions slightly 
attenuated the anorexic effect of ampheta- 
mine, but the attenuation did not reach sta- 
tistical significance. Since Carlisle and Reyn- 
olds (1961) observed an exaggerated effect 
of amphetamine on feeding following brain- 
stem (area postrema ) lesions, sites down- 
eam to the hypothalamus may also serve 
ancillary functions in the total 
e drug’s action. Ruch, Patton, 
(1942) have identified fibers 
that descend from the lateral hypothalamic 
“feeding center” and exert their influence 
otor neurons and nerves 


upon internuncial moto a 
involved in the execution of feeding: Thus, 
it is possible that the exaggerated effect of 


the drug in this latter case is due to the “те. 


stri 
important 
mediation of th 
and Brobeck 
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lease of restraint" on executive functions of 
feeding. 


CONCLUDING COMMENTS 


Amphetamine is similar in structure to the 
catecholamines and mimics many of their cen- 
tral effects (Carlton, 1963). Furthermore, the 
importance of such adrenergic systems to the 
effects of the drug on nonfeeding performance 
is well established (Anden, Carlsson, & Hag- 
gendal, 1969; Maickel, Cox, Miller, Segal, & 
Russell, 1969; Weiss & Laties, 1969). 

Although in some instances it may be pos- 
sible to separate the brain mechanisms under- 
lying the effects of amphetamine on feeding 
from those underlying the effects of the drug 
on nonfeeding tasks, this would not always 
seem to be possible. Separate mechanisms 
have been suggested in the case of general 
motor behavior and feeding (Cole, 1966; Van 
Rossum & Simons, 1969); however, in the 
case of operant performance, the brain sites 
regulating self-stimulation and feeding may 
overlap considerably (Gluckman & Baum, 
1969; Hoebel & Teitelbaum, 1962; Khavari, 
1969; Olds, 1962; Pickens & Harris, 1968; 
Poschel & Ninteman, 1966; Stein, 1964). 

Future research should attempt to clarify 
further the role of hypothalamic mechanisms 
in mediating the anorexic properties of am- 
phetamine and to identify more specifically 
the adrenergic systems underlying the drug's 
action on nonfeeding performance. In those 
instances where there is an overlap of such 
mediational processes, attempts should be 
made to identify the structural and/or func- 
tional nature of the overlap. 


REFERENCES 


L. H. Hyperphagia in 
ventromedial and lateral 
hypothalamus, Science, 1969, 165, 599-600. 

d. A. Hypothalamic con- 
and cats. Yale Journal 
51, 24, 123-138. 

А, & HAGGENDAL, Js 

nual Review of Phar- 


dogs. 
1957, 38, 22-30. 


AREES, E, A, & 


Maver i А 
between medial ‚ J. Anatomical connections 


and latera] regions of the hypo- 


thalamus concerned with food intake. Science, 
1967, 157, 1574-1575. 

Baracura, S, & Devenvort, L. D. Feeding patterns 
of normal and ventromedial hypothalamic lesioned 
male and female rats. Journal of Comparative and 
Physiological Psychology, 1970, 71, 357-364. 

Воотн, D. A. Amphetamine anorexia by direct ac- 
tion on the adrenergic feeding system of rat hypo- 
thalamus. Nature, 1968, 217, 869-870. 

BROBECK, J. К. Mechanism of the development of 
obesity in animals with hypothalamic lesions. 
Physiological Review, 1946, 26, 541—559. 

BROBECK, J. R., Larsson, S, & Reyes, Е. A Study 
of electrical activity of the hypothalamic feeding 
mechanism, Journal of Physiology, 1956, 132, 
358-364. 

Вковеск, J. R., TEPPERMAN, J., & Loxc, C. №. Н. 
Experimental hypothalamic hyperphagia in the 
albino rat. Yale Journal of Biology and Medi- 
cine, 1943, 15, 831-852. 

Bnooxs, C. McC., Lockwoop, R. А., & Wicorss, M. 
L. A Study of the effects of hypothalamic lesions 
on the cating habits of the albino rat. American 
Journal of Physiology, 1946, 147, 735-741. 

Carey, К. J., & SALIM, A. P. Changes in d-ampheta- 
mine reactivity resulting from septal forebrain in- 
jury. Physiology and Behavior, 1970, 5, 133-136. 

Caruste, H. J. Differential effects of amphetamine 
on food and water intake in rats with lateral 
hypothalamic lesions. Journal of Comparative a” 
Physiological Psychology, 1964, 58, 47-54. 

CARLISLE, H. J., & Rrvworps, К. W. Effect of am- 
phetamine on food intake in rats with brain-ste™ 
lesions. American Journal of Physiology, 1961, 201, 
965-967. 

CARLTON, Р. L. Cholinergic mechanisms in the COP 
trol of behavior by the brain. Psychological Re- 
view, 1963, 70, 19-39. p 

Carr, L. A, & Moore, K. E. Norepinephrine: хе 
lease from brain by d-amphetamine in vivo. Set 
ence, 1969, 164, 322-323. by 

Core, S. О. Increased suppression of food intake с 
amphetamine in rats with anterior hypothala 
lesions. Journal of Comparative and Physiolog!t 
Psychology, 1966, 61, 302-305. € 

Corr, S. O. Experimental effects of amphetamine 
review. Psychological Bulletin, 1967, 68, 81-99. 

Corr, S. О. Brain mechanisms and depressant ай 
of amphetamine on feeding behavior. Psycholog" 
Reports, 1968, 23, 775-782. ine: 

Corr, S. О. Experimental effects of ampheta jjs, 
Supplementary report. Perceptual and Motor > 
1970, 31, 223-232. dm 

Cox, V, & Kaxorrwskr J. W. Sex differenc® ol 
body weight regulation in rats following * 
hvpothalamic lesions. Communications i" 
ioral Biology, 1970, 5, 195-197. x É 

Cox, V. C., KAKOLEWSKI, J. W., & VALENSTES nges 
S. Ventromedial hypothalamic lesions and le and 
in body weight and food consumption in yyi 
female rats. Journal of Comparative and 
logical Psychology, 1969, 67, 320-326. 

Dercano, J. W., R., & Anann, B. К. Increas 
intake induced by electrical stimulation 


e of ° 


AMPHETAMINE ANOREXIA 19 


laters j У б 
bro vc --—-— 
pea $5 Wise, К. A, MILGRAM, N. W, & 
tior T G. Physiological control of hypo- 
of С ically elicited feeding and drinking. Journal 
і 1970, xin and Physiological Psychology, 
ыс L. V. Role of postoperative feeding ex- 
dns ce m recovery from lateral hypothalamic 
Psychol ournal of Comparative and Physiological 
EPSTEIN, ogy, 1970, 72, 60-65. 
ае A. Suppression of eating and drinking by 
hype eis and other drugs in normal and 
Phosiol agic rats. Journal of Comparative and 
Ектрм. ® ogical Psychology, 1959, 52, 37-45. 
eee 5. E. LARSSON, S, Druck, M. К, & 
Jong rend S. Apharia in chickens. American 
soul of Pyslelogy, 1851, 123, 189-07. 
We sis M. H, Fuxpenpurx, W. H, & Warp, T. 
Msg ue on the site of action of a new anorectic 
Experi enfluramine. Journal of Pharmacology and 
Fors rimental Therapeutics, 1969, 165, 60-70. 
Th ER, J. L., Rosvorp, H. E, & Prreram, K. Н. 
cea on affective and cognitive behavior in 
cum og of lesions of the pyriform-amygdala-hippo- 
io npal cortex. Journal of Comparative and Physio- 
s Bical Psychology, 1957, 50, 89-96. 
ав D. Effects of d-amphetamine and frontal 
"d ations on response suppression in rats. Journal 
196 Comparative and Physiological Psychology, 
Сил 9, 69, 49-54. 
ырк, 5. D, & Jarvis, М. E. Differential effects of 
е апа scopolamine on matching per- 
уун ы ы of monkey with lateral frontal lesions. 
ana of Comparative. and Physiological Psy- 
Niue By, 1970, 73, 307-313. 
of KMAN, M. I, & Baum, 
Iprindole, a new anti-depressant. 
, Pharmacologia, 1969, 15, 169-185. 
Ts R. M. Hypothalamic hyperphagia produced 
pe parasagittal knife cuts. Physiology and Be- 
Pig 1970, 5, 23-25. (a) A 
LD, R, М. Hypothalamic hyperphagia: Males get 
= as fat as females. Journal of Comparative 
к Physiological Psychology, 1970, 71, 347- 
56. (b) А 
peta & STELLAR, E. Hyperphagia, 
logic DL Journal of Comparative an 
биа al Psychology, 1962, 55, 418-424. 
зма S. P. Eating or drinking elicited 
EO еше or cholinergic stimulation 0 
Ross alamus, Science, 1960, 132, 301-302. 
sti MAN, S. P. Behavioral effects of chemical 
imulation of the ventral amygdala. Journal of 
s; ative and Physiological Psychology, 1964, 
^ » 29-36. 
кыш C. L. Interaction. of 
P ds regulation in the rat. Journal 
76-48%, and Physiological Psychology, 
2 5. C, Туу, А. Cy & SEARLE, L. M. The 
enu o of amphetamine-induced loss of weight. 
1947 al of the American Medical Associations 
» 134, 1468-1475. 


T. The pharmacology 
Psycho- 


obesity, and 
d Physio- 


G 


] by di- 
f the 


food intake and 
1 of Com- 
1963, 56, 


€ s W. The production of hypo- 

mee y in rats already displaying chroni 

hypopituitarism. American Tournal of Ph ү. 
1943, 140, 89-02. 97 наори 

HETHERINGTON, n W Non-production of hypo- 
thalamic obesity in the rat by lesions rostral to 
the ventro-medial hypothalamic nuclei. Journal of 
Comparative Neurology, 1944, 80, 33-45. 

HOEBEL, B. G. Hypothalamic lesions by electro- 
cauterization: Disinhibition of feeding and self- 
stimulation. Science, 1965, 149, 452-453. 

Horse, B. G. Feeding: Neural control of intake. 
Annual Review of Physiology, 1971, 33, 533-568. 

HOEBEL, B. G., & TEITELBAUM, P. Hypothalamic con- 
trol of feeding and selí-stimulation. Science, 1962 
135, 375-376. : 

Joxasow, К. R., Lauper, S. M. ROBBINS, M. J., 
Meyer, P. M, & Meyer, D. R. Effects of am- 
phetamine upon relearning pattern and black- 
white discriminations following neocortical lesions 
in rats. Journal of Comparative and Physiological 
Psychology, 1970, 73, 47-55. 

Kuavarr, A. Effects of central versus intraperitoneal 
d-amphetamine administration on learned behavior. 
Journal of Comparative and Physiological Psy- 
chology, 1969, 68, 226-234. , 

Kirscupaum, W. К. Excessive hunger as a symptom 
of cerebral origin. Journal of Nervous and Mental 
Disease, 1951, 113, 94-114. 

Kreps, H., BINDRA, D, & CAMPBELL, J. F. Effects of 
amphetamine on neural activity in the hypothala- 
mus. Physiology and Behavior, 1969, 4, 685-691. 

LEIBOWITZ, 5. F. Hypothalamic B-adrenergic “satiety” 
system antagonizes an a-adrenergic “hunger” sys- 
tem in the rat. Nature, 1970, 226, 963-964. 

LIEBELT, R. A., & Perry, JE Hypothalamic lesions 

with goldthioglucose-induced obesity. 

for Experimental Biology 


assosciated 
Proceedings of Society 
and Medicine, 1957, 95, 774-777. 
Marcxet, R. P, Cox, R. Н., Jn, MILLER, E. p 
SEGAL, D., & RUSSELL, R. W. Correlation of brain 
levels of drugs with behavioral effects. Journal of 
Pharmacology and Experimental Therapeutics, 
1969, 165, 216-224. j ? 
Marcutes, D. L. Noradrenergic synapses 1n peri- 
fornical hypothalamus for the suppression of feed- 
ing behavior by satiety. Federation Proceedings, 
1969, 28, 641. (Abstract) 
Marcutes, D- L. Alpha-adrenergic receptors in hypo- 
thalamus for the suppression of feeding behavior 
by satiety. Journal of Comparative and Physiologi- 
cal Psychology, 1970, 73, 1-12. 
MARSHALL, N. B, BARRETT, R. J» & MAYER, J 
Hypothalamic lesions in goldthioglucose injected 
mice. Proceedings of Society for Experimental 
Biology and Medicine, 1955, 90, 240-244. И 
Meyer, D. К. & MEYER p. M. Brain functions. 
Annual Review of wes ce a di 
Ч Motivational effects of brain stimula- 
маш& a Ln Federation Proceedings, 1960, 19, 


ше Effects of amphetamine on selí 
а . Effects a m 
Mou T induced drinking. Physiology and 


stimulation 2 lu 
Behavior, 1968, 3, 133-136. 


20 SHERWOOD О. COLE 


Moreang, P. J. Distinct feeding and hunger motivat- 
ing systems in the lateral hypothalamus of the 
rat. Science, 1961, 133, 887-888. (a) 

Morcang, P. J. Electrophysiological studies of feed- 
ing and satiety centers in the rat. American Jour- 
nal of Physiology, 1961, 201, 838-844. (b) 

Morrison, S. D. The relationship of energy ex- 
penditure and spontaneous activity to the aphagia 
of rats with lesions in the lateral hypothalamus. 
Journal of Physiology, 1968, 197, 325-343. 

Murray, H. M, & Brown, T. S. Effects of hippo- 
campal lesions on simple and preferential consum- 
matory behavior in the rat. Journal of Compara- 
tive and Physiological Psychology, 1970, 72, 404— 
415. 

NATHANSON, M. Н. The central action of benzedrine. 
Journal of the American Medical Association, 
1937, 108, 528-531. 

NOVICK, L, & Prat, R. Effect of amphetamine on 
the septal syndrome in rats. Journal of Compara- 
tive and Physiological Psychology, 1969, 68, 220- 
225. 

Orns, J. Hypothalamic substrates of reward. Physio- 
logical Review, 1962, 42, 554-604. 

Pickens, В., & Harris, W. C. Self-administration of 
d-amphetamine by rats. Psychopharmacologia, 
1968, 12, 158-163. 

Poscuet, B. P. Н, & NINTEMAN, F. W. Hypothal- 
amic self-stimulation: Its suppression by blockade 
of norepinephrine synthesis and reinstatement by 
methamphetamine. Life Sciences, 1966, 5, 11-16. 

Ромку, T. L, & Kersey, В. E. Relationship of 
body weight to the lateral hypothalamic feeding 
syndrome. Journal of Comparative and Physio- 
logical Psychology, 1970, 70, 23-36. 

Pripram, K. Н., & BacsHaw, M. Further analysis 
of the temporal lobe syndrome utilizing fronto- 
temporal ablations. Journal of Comparative Neu- 
rology, 1953, 99, 347-375. 

REITER, L. Effects of amphetamine on lateral hypo- 
thalamic activity in response to amygdaloid stimu- 
lation. Federation Proceedings, 1970, 29, 383. (Ab- 
stract) 

Reynotps, R. W. The effect of amphetamine on 
food intake in normal and hypothalamic hyper- 
phagic rats. Journal of Comparative and Physio- 
logical Psychology, 1959, 52, 682-684. 

Reyyotps, R. W. Ventromedial hypothalamic lesions 
without hyperphagia. American Journal of Physiol- 
ogy, 1963, 204, 60-62. 

Кгүхотрѕ, К. W, An irritative hypothesis concern- 
ing the hypothalamic regulation of food intake. 
Psychological Review, 1965, 72, 105-116. 

Ropcers, W. L., Epstein, A. N, & TEITELBAUM, Р. 
Lateral hypothalamic aphagia: Motor failure or 
motivational deficit ? American Journal of Physiol- 
05У, 1965, 208, 334-342. ? 

Ruc, T. C., Patton, Н. D,& BROBECK, J. R. 
Hyperphagia and adiposity in relation to distur- 
bances of taste. Federation Proceedings, 1942, | 
76. (Abstract) Eres 

Sctaranr, A. BELLUzzr, J. D, & Grossman, S. P. 
Effects of lesions in the hypothalamus and amyg- 


dala on feeding behavior in the rat. Journal of 
Comparative апа Physiological Psychology, 1970, 
72, 394—403. 

SHARP, J. C., NEILSON, Н. C., & Porter, Р. B. The 
effect of amphetamine upon cats with lesions in 
the ventromedial hvpothalamus. Journal of Con- 
parative and Physiological Psychology, 1962, 55, 
198-200. : 

STARK, P., & Torry, С. W. Effects of amphetamine 
on eating elicited by hypothalamic stimulation. 
Journal of Pharmacology and Experimental Thera- 
peutics, 1967, 158, 272-278. 

STEIN, L. Amphetamine and neural reward mecha- 
nisms. In H. Steinberg (Ed.). Animals behavior 
and drug action, London: J. & A. Churchill, 1964. 

Stowe, F. R, & Miter, A. T. The effect of am- 
phetamine on food intake in rats with hypo- 
thalamic hyperphagia. Experientia, 1957, 13, 114- 
115. 

Stromrincer, J. L., BROBECK, J. R., & Conr, R. L. 
Regulation of food intake in normal rats and in 
rats with hypothalamic hyperphagia. Vale Journal 
of Biology and Medicine, 1953, 26, 55-74. 

TEITELBAUM, P., & STELLAR, E. Recovery from the 
failure to eat produced by hypothalamic lesions. 
Science, 1954, 120, 894-895. 

TEITELBAUM, P., & Ерѕтеіх, А. The lateral hypo- 
thalamic syndrome: Recovery of feeding and drink- 
ing after lateral hypothalamic lesions. Psychologi- 
cal Review, 1962, 69, 74-90. 

Trove, W. F., & Carriste, H. J. Effect of lateral 
hypothalamic stimulation on amphetamine-induced 
anorexia, Journal of Comparative and Physiologt- 
cal Psychology, 1968, 66, 547-548. 

Van Rossum, J. M., & Simons, F, Locomotor ac 
tivity and anorexogenic action. Psychopharma- 
cologia, 1969, 14, 248-254, | 

Wane, G. N., & ZUCKER, I. Modulation of food in- 
take and locomotor activity in female rats by 
diencephalic hormone implants. Journal of Col 
parative and Physiological Psychology, 1970, 72, 
328-336. | 

WAGNER, J. W, & DE Groot, J. Effect of goldthio- 
glucose injections on survival organ damage ans 
obesity in the rat. Proceedings of Society for Ex- 
perimental Biology and Medicine, 1963, 112, 33-37. 

Weiss, B., & Lates, V. С. Behavioral pharmacolo£* 
and toxicology. Annual Review of Pharmacology 
1969, 9, 297-326. i 

Warme, N. M, & Fisurm, A. E. Relationship Me 
tween amygdala and hypothalamus in the a 
of eating behavior. Physiology and Behavior, 199^ 
4, 199-205. "m 

WILLIAMS, D. R., & Тытеваом, P. Some ed 
tions on the starvation resulting from lateral hyp 
thalamic lesions. Journal of Comparative @ 
Physiological Psychology, 1959, 52, 458-465. 

WYRWICKA, W., & Dommzrcka, C. Relationship x 
tween feeding and satiation centers of the hYP 
thalamus. Science, 1960, 132, 805-806. 


be- 


(Received June 25, 1971) 


Psychological Bulletin 
3, Vol. 79, No. 1, 21-44 


OLFACTORY CONTROL OF BEHAVIOR IN RODENTS 3 


р 


Е. FRED SCHULTZ anp JACK Т. ТАРР? uo Bp 
1 


Vanderbilt University 


Behavioral literature is examined for information that could clarify the manner 
in which rodent behavior is mediated by olíaction. Functional use of olfaction 
is readily apparent in some of the behaviors reviewed, for example, bait shyness. 
olfactorally modulated aggression, olfactory marking of and SEDED of 
stressful environments, and inhibition of food seeking by odors generated from 


frustration. Other behaviors are revie 
an important, though yet undelineated 
preference based on sexual dimensions. 
are reviewed which meet current 
any apparent function for the ani 
standing of the functional use of 
philosophy of research away 
odorants 
behaviors in natural social contexts. 


Баана of the sense of smell by laboratory 
avior Provides investigators of animal be- 
ign a subject somewhat akin to the 
seldon r—a topic generally discussed but 
1 1 acted upon. The olfactory capabilities 
08 какан animals, particularly the rat, 
нф. with a respect bordering on awe. 
the limit is obviously necessary to determine 
ish a » S of these capabilities and to estab- 
view ا‎ foundation for such a point of 
entatic ut more importantly, a conceptual ori- 
directio, 1s necessary to give this research 
iews ie This is particularly true when one 
S rat е origins of research on olfaction in 
ion ha, Laboratory studies of rodent olfac- 
Bram ave grown out of rodent control pro- 
5, extensions of experimental techniques 


ro : 
m other sense systems to olfaction, and 


at 2 
tempts to explain aberant experimental re- 
of phero- 


sults ; 
Dm learning studies. The study 
resean oo also had its effects on rodent 
ave 8 (Gleason & Reyneirse, 1969), as 
these ecological studies of the rat. However, 
Prody varied motivations for research have 
аен a spotty coverage of the range of 
пс les in which odors are suspected to 
lon, 


liş purpose of this review is to attempt 
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wed in which olfaction is thought to play 


role, for example, differences in approach 
Still other olfactory-associated behaviors 


ind behavioral measures tow 


standards of inferential significance yet lack 
mal. It is concluded that a coherent under- 
[ olfaction by rodents awaits a shift in the 
from rigorous examination of loosely selected 


ard rigorous examination of normal 


to impose some order on the myriad of studies 
on the olfactory control of rodent choice 
behaviors that have appeared in the literature 
within the past decade. The emphasis of this 
review is on the functional properties that 
odors have in the regulation of rodent be- 
haviors. We follow the organization used by 
Barnett (1963) in his comprehensive study 
of the behavior of the rat because of the 
functional emphasis of this work, Within this 
organization there are repeated references to 
approach and avoidance behaviors observed 
by the investigators, but these behaviors gain 
their significance in the context of the total, 
functionally defined behavior. The review has 
been limited to the molar behavior of the 
macrosmotic order Rodentia, and studies of 
discrimination, sensitivity, or other "trained" 
responses to odors are generally excluded. 


E LiviNG SPACE 


For the majority of rodents, odors repre- 
sent the leading edge of the environment. 
When the visual world is bordered by the 
nearest pebble, clump of grass, or baseboard, 
a nose that is within a couple of inches of 
the ground offers enormous potential for 
directing behavior. Deferring discussion of 
movement associated with feeding, social, 
and sexual behavior, there remains the body 
of research on movement which relates to 


the role of odor 


MOVEMENT IN TH 


[57 
N 


Response to Novel Odors 


Like other stimuli, odors can be defined 
as novel when the test animal has had no 
previous known experiences with the odor. 
Experiments examining the responses of ro- 
dents to novel odors have employed a fairly 
wide range of odors off the shelf though the 
test for responding has typically been simple 
approach or withdrawal. The rationale for 
these studies has differed from different 
laboratories. 

Reiff (1956) made the initial assumption 
that the odors of normal food and odors from 
natural habitats were made attractive by dis- 
crete chemical components in the odor com- 
plex. He evaluated the preferences exhibited 
by wild rodents for a number of chemically 
pure olfactory stimuli. His sample included 
black rats (Rattus rattus), brown rats (Rattus 
norvegicus), house mice (Mus musculus), 
field mice (Microtus arvalis), and voles 
(Arvicola terrestris), as well as white mice 
and rats. The test odorants were (а) oils, 
essences, and extracts from animal and vege- 
table tissue and (5) organic acids and amines. 
These were placed in open jars in an open 
field. During the first phase of experimenta- 
tion 10 pairs of white mice or white rats 
were placed in the field, and a record was 
made of the number of approaches to each 
substance. Each substance was tested several 
times in different combinations and Reiff 
tested “the interesting preparations about 
30 times [p. 292].” 

The substances receiving further investiga- 
tion were presented to some combination of 
the captured wild rodents. From the report 
of the results it was difficult to discriminate 
between data obtained from the wild rats and 
mice and that obtained from the albinos. 
Each presentation of several odors produced 
a total number of approaches to the jars, and 
a percentage of approaches was calculated 
тн Ап average was then made 

age of approach scores for each 
substance and these average scores 
scaled so that a value of 100 repres 
indifference with respect to the comp: 
substances, No blank control was run to pro- 
vide an absolute reference point and no vari- 
ance statistics were reported, limiting the 
inferential properties of the measure. Retest- 
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ing the "interesting" odors systematically 
changed the nature of the base line of com- 
parison and resulted in the potential loss of 
the descriptive statistics ordinal properties. 
Finally, subjects were used for repeated ob- 
servations and no mention was made of sub- 
ject scheduling. Reiff's results must therefore 
be viewed with reservation, and considering 
the large number of substances he investi- 
gated, this is unfortunate. 

Of the odorants which closely resembled 
natural food and body odors, the following 
substances scored higher than 100: ambergris; 
natural and synthetic musk; furfuraldehyde 
(bran oil); vanillin; oil of peppermint; and 
oils from cod liver, rape seed, cotton seed, 
soy beans, thyme, and anise, The following 
substances scored lower than 100: civet; 
valerian; sperm oil, and oils from linseed, 
corn, peanuts, fennel, parsley, and celery. 

Albino rats and mice were used to test 
over 200 organic acids and amines, which 
were assumed to be a fractional component 
of naturally attractive mixtures. After cont 
paring results from the two groups of odor- 
ants, it was Reiff's opinion that no isolated 
compound was as effective an attractant 45 
the naturally occurring mixtures. Contrary t0 
Reiff's initial hypothesis, it appeared that the 
odors from normal food and from natural 
habitats are not made attractive by the 
discrete chemical components. He did observe 
high preference values for dihydroxyethylan" 
line, dihydroxyethylbenzylamine, and diiso" 
butylbenzylamine and concluded that tertia? 
amines were effective attractants. It is Ul 
fortunate that Reiff's results must be view® 
with reservation because of the problem? 
implicit in his methodology and in the геро! 
of his experiment. The work is of significa? 
value and more investigations should be СО! 
ducted to follow up the results. J 

Long and Tapp have examined rat ae 
sponses to novel odors in a series of stu e 
conceived to evaluate the reinforcing prope 
ties of odors. The measure they employ 
provides some indication of the function, , 
olfactory cues in exploration. Male a p» 
rats were given free-operant tests in a doub ). 
lever Skinner box (Long & Tapp, 
Pressing either lever delivered а light $ 
of air in the vicinity of the nose of the 
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E i . was odorized while the other 
bus, i „Ста air source flushed the 
of stray odors. Preference was in- 
lerred from differences in rates of lever 
pressing. 
In one experiment (Long & Tapp, 1967), 
ше of the odors was obtained by passing air 
| Over amyl acetate. The lever that produced 


жу 


om of odorized air was pressed signifi- 
this VM times than the other lever, but 
lis ва, decreased over three 30-minute 
in Yun ea This effect was not different 
fe rom other studies of light-onset re- 
ment conducted in the same laboratory 
on Tapp, 1969, for review). When а novel 
the уаз compared with light onset and with 
p all MS of other stimuli, the odor was over- 
different reinforcing, but not qualitatively 
196 nt from light onset (Tapp & Long, 
8). 
ae double-lever box was used to evaluate 
pie toa series of novel odorants (Tapp, 
über & Spiegler, 1967). The particular 
га апіѕ were chosen to provide а stimulus 
m each of the seven categories which 
m (1952) suggested as representing the 
Magd odors. Two methods of testing were 
мн it determine if both methods would re- 
i Dit same heirarchy of odor preferences. 
With n irst study, a single odor was compared 
n th © odor in different groups of animals. 
with : second study, odors were [gown 
three mue and er : лу 
ence the substances testec ee ie bath 
meth зш. in the same direction y 0 
» Ods: d-camphor was consistently pre- 
Ted, while musk ambrette and formalde- 


Yde were consistently not preferred. In all 
able preferences 


ar thee experiments, no dur 
omit odors were observed. If these odors 
dee tes properties, their ability to 
In ever pressing is at best transitory. 
gia, ir studies conducted by Long and 
Were o more durable reinforcing properties 
Observed for the odor of food. Though 
Somewhat different picture emerges for 
factors an the studies shed some light, on 
explo that are important in the control o 
» ini by olfactory stimuli. 
1967) Po these experiments (Long & Ta 
Over dr ne odor was obtained by passing 
Y powdered rat chow. There was a 


PP» 
air 


higher rate of responding on the lever that 
produced the food odor than on the control 
diis dood qui “There ws p ct 

! . There was no differ- 
ential responding for food odor when the rats 
were satiated. 

Long and Stein (1969) demonstrated that 
the lever-pressing task evaluated not only 
the attractive component of the odor, but 
that the response measure was sensitive to 
the change in stimulus level produced by the 
response. Three groups of male rats were 
tested in the double-lever box, and for all 
groups, the air in one of the carrier airstreams 
was odorized. For the first group, pressing 
a lever resulted in the onset of the adjacent 
airstream, For the second and third groups, 
both airstreams were on constantly and lever- 
pressing by the second group turned off the 
adjacent airstream for 1 second, while lever- 
pressing by the third group had no effect on 
the airstreams. All three groups exhibited 
more responding on the lever adjacent to the 


airstream than on the one near the 


odorized 
nished 


unodorized airstream. This rate dimi 
over five experimental sessions. 

When a response produced odor onset, the 
animals showed the greatest preference for 
the odorized lever. Turning the air off yielded 
somewhat less preference for the odor lever. 
The noncontingent arrangement resulted in 
the least magnitude of preference for the 
odorized lever, although the difference be- 
tween the odorized and the unodorized lever 


was still significant. 
The conclusions of t 
direct bearing on the ro i 
regulating exploration. The presentation of 
the odorized air to the noncontingent group 
attracted the animals to the vicinity of the 
lever, and they exhibited a higher response 
rate on the lever nearest the odor. The proba- 
bility of responding is enhanced if the stimu- 
]us changes following a lever press, though E 
onset of the odor of food is a more powerfu 
reinforcer for lever-pressing than Medea 
offset. The stimulus in these latter a i 
is a highly conditioned reinforcer а is 
animal has had an extensive sid wi : а 
odor. For this reason the durability of the 
reinforcing properties of the food А was 
uch greater than that observed for any 


his experiment have 
le of olfaction in 
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novel odor, and deprivation seems to be a 
necessary condition for the food odor to have 
these behavior-directing properties. 

The conclusions from this series of experi- 
ments are rather clear: (a) Like other stimuli, 
some novel odors can be important stimuli 
for directing the course of exploratory be- 
haviors, but they lose their behavior-direct- 
ing properties as they lose their novelty. 
(b) The power of an odor to attract appears 
to be a function of the animal's past history 
with the odor. (c) The behavior-directing 
properties of a significant odor such as a 
food odor can be enhanced when the presence 
of the odor is contingent on a particular re- 
sponse and the animal is deprived. 


Response to Biological Odors 


Biological odors in this context are those 
generated by animals. Their existence is gen- 
erally inferred from observing the behavior 
of the test animal. The range of such odors 
to which a rodent can respond is potentially 
vast but only a limited number of odors of 
this sort have yet been examined in detail. 


Odors from Conspecific 


Efforts to understand the manner in which 
odors from conspecifics can influence move- 
ment about the living space are considerably 
frustrated by man's inability (or unwilling- 
ness) to smell objects and animals in the 
environment in the same fashion as do the 
animals themselves. There are four major 
lines of inquiry which suggest that odors 
from conspecifics have an influence on move- 
ment patterns: territorial marking, spontane- 
ous alternation, stress odors, and frustration 
odors. It is possible that the odors mediating 
these four phenomena may not be entirely 
different, though such a suggestion is cur- 
rently untested. The existence of other such 
phenomena is also a very real possibility. 

Territorial marking. Territorial marking, 
along with food selection and mate identifica- 
tion, is one of the most widely acknowledged 
olfactory functions that urban man Жест Des 
to mammals. Convincing research on terri- 
torial marking, particularly in rodents, is 
conspicuously lacking. A large number of in- 
vestigators have observed behavior resembling 
the marking of territory with odorants, and 
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others have observed behavior that seems con- 
sistent with the existence of territorial marks, 
but much remains to be tied together. (See 
Eisenberg & Klein, 1972 and Ralls, 1971.) 

Some behavior patterns suggestive of terri- 
torial marking have been studied in rodents. 
Reiff (1956) found that the faintly visible 
marks left by rats in their communal trails 
and in nests possessed an odor which elicited 
investigation if used as a test odorant. He 
found that rats’ urine had the same effect if 
it were allowed to ferment, but this attractant 
property was not observed if the urine were 
fresh or if the solids, including the bacteria, 
were removed by centrifugation before aging. 

A number of species of rodents possess 
externally ducted glands and indulge in rub- 
bing behavior that is frequently referred to 
as marking. Among rodents for which such 
behaviors have been studied are members of 
two families: (a) Heteromyidae, which is the 
family of pocket mice (Eisenberg, 1963) and 
(b) Cricetidae, which includes both gerbils 
(Baran & Glickman, 1970; Thiessen, Blume, 
& Lindzey, 1970) and new world rats and 
mice. Although the evidence suggests that 
other conspecifics respond differentially to ob- 
jects if they are marked or unmarked, the 
dimensions of the behavior observed do not 
yet suggest a territorial delineating function. 
Additional study is indicated. 

Spontaneous alternation. Odors from CON 
specifics have effects on the regulation of ех” 
ploration in a novel environment. Specifically: 
it is rather clear from a number of studies 
that the rat’s presence in a novel environment 
can provide cues that decrease the chances 
that the animal will return to that same plac? 
at the next opportunity. One of the major 
cues that produces this behavior pattern ар” 
pears to be odor left by the animal. б 

The identification of this function of 04° 
has arisen from research on spontaneous al- 
ternation, a behavior pattern in which a 
animal enters an alley different from one 
chosen previously during the unrewarded €* 
ploration of a T-maze or a cross-maze. —, 

Douglas (1966) examined the theoretic? 
effect of stimulus preference upon choice 4 
havior and derived a formula that parcele 
from the observed alternation rate the e 
of both chance and differential stimulus ? 
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tractiveness. This formula yielded a “true” 
alternation tendency. He empirically vali- 
E the assumed statistical independence 
Ph peated-alternation measures on а homo- 
geneous population of animals and derived 
à parametric base line for further investiga- 
E o alternation behavior. The experi- 
ie problem was to examine the role of 
naze, extramaze, and response-induced 
cues, 
b lue ende of his studies showed that ani- 
m — from chance alternation for 
ae e a only: (a) to avoid their own 
Жш. m and (0) to walk in a direction 180 
effect e direction chosen previously. These 
fiesh > were demonstrated by providing à 
tion vanae floor, which eliminated alterna- 
as an odor trail, and by passively turning 
a nimals for eight rapid horizontal arcs 
fua ж trials to disrupt the ability to alter- 
i m оп a vestibular-maintained spatial 
бг ats did not alternate to the extramaze 
or nag ,Cues provided by pepperminty- 
mee | liquid soap at the end of one arm 
h y aromatic pipe tobacco at the other. 
d did show a preference for the pepper- 
he odor indicating that it was noticed. 
iene of a spatial map provided a 
trail 5 percentage of alternation than did odor 
е ие. and the additive effect of the 
on os combined accounted for the alterna- 
rate when all cues were present. 
eu (1967) demonstrated that male rats 
allo aneously alternated when they were 
E Wed to use either an odor trail as а спе, 
"be spatial map based on the vestibular 
nated as a cue, or both. Alternation was elimi- 
Nas On test trials when both the paper floor 
3 replaced and the maze was rotated 
9 i n was confirmed in two experiments 
(196 & substantial support for Douglas's 
6) conclusions. 
bor qn (1968) has provided additional sup- 
ment OF Douglas’s conclusions in an oe 
relative which the attempted to assess is 
factory importance of vestibular = ol- 
test ^ Aie Only one exposure trial s as 
ulb ejes given to rats. Bilateral ol qe! 
Aerosol] ations and cleaning the maze ? : 
meth Windex reduced odor cues. Bol 
ods eliminated alternation to odor trail 


Cues, f 
+ Passive horizontal rotation of the sub- 


ject through 180° eliminated alternation to 
spatial cues when odor was not available. 

A rotatable cross-maze was used to put 
spatial cues in conflict with the odor trail 
left in the choice arms. When this was done, 
alternation to spatial cues occurred with the 
resulting behavior being approach to the odor 
trail. Latencies were significantly higher on 
test trials when either odor or spatial cues 
were absent or were in conflict. 

Still and MacMillan (1969) administered 
alternating trials involving four odor trail 
conditions to 24 male and 24 female hooded 
rats. The vestibular map was disrupted by 
the use of two T-mazes set at 90? to each 
other. Manipulation of the floor paper be- 
tween trials provided four odor trail condi- 
tions. In the first. condition, the paper was 
moved to the other maze intact from the first 
trial to the second. Under this condition, the 
subjects alternated away from the odor. For 
the second condition, fresh paper was used 
{ог the second trial, and no alternation was 
observed. These two conditions replicate the 
findings of Douglas (1966), Brill (1967), 
and Rosen (1968), with the additional result 
that females alternated to the same extent as 
males. 

The third condition was created by rotating 
the floor paper before the second trial so that 
the termination of the old odor trial underlay 
the start box and that portion of the trail 
that had been in the old start box was now in 
the opposite goal box. To turn away from 
the odor trail at the choice point would be 
to repeat the same direction of turn that 
occurred in the previous trial. Under this 
condition, the animals alternated left-right 
directionality (a response-induced cue) and 
thereby backtracked over the old odor. The 
fourth condition consisted of placing the old 
trail under the stem so that when placed in 
the start box the animal was standing on the 
odor from the choice point, and upon reaching 
the choice point, the odor was that of the 
old start box. In other words, only half of 
the total trail was used, that part which led 
from the start box to the choice point. per 
group also alternated left-right ере а 
Apparently in the absence of both a ves E 
bular map and untampered odor trail, there 
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is alternation using the response-induced cue 
of left-right directionality. 

Klein and Brown (1969) also investigated 
the role of olfaction in spontaneous alterna- 
tion. Half of their rats were rendered anosmic 
by ablation of the olfactory bulbs. All ani- 
mals received successive tests for alternation 
in a T-maze and for exploratory behavior in 
an open field maze. The intact animals alter- 
nated at a rate significantly greater than 
chance while the anosmic animals did not. 
While exploring the open field, the anos- 
mic animals were considerably more active 
throughout a session than were the intact 
ones. The anosmic rats did not show the 
habituation in open-field activity when tested 
on three successive days that the intact ani- 
mals showed. These three differences in pat- 
terns of exploration could all arise as a con- 
sequence of an anosmic animal being unable 
to recognize an area as being one that he had 
already explored. 

These studies of spontaneous alternation 
leave little doubt that odor is a critical cue 
in producing alternation behavior. The exact 
part played by odors is unclear, though the 
results suggest that the odor that the animals 
responded to is one which is left by the ani- 
mal, and which functions as a sign to guide 
future choices. Our understanding would be 
considerably improved if it were known how 
well rats can discriminate directionality in an 
old trail and how long the trail lasts. Further- 
more, it would be of interest to know if 
animals could recall facts about the local en- 
vironment based on cues from a specific seg- 
ment of the odor trail. Such abilities may 
not be unreasonable in macrosmatic animals. 
Finally, this phenomenon has many of the 
aspects of territorial marking and perhaps 
could be further explored in this context. 

Stress-generated odors. Several recent ex- 
periments have investigated an apparent odor 
that rodents generate when placed under con- 
ditions of imminent or actual physical stress. 
Tn a clear demonstration, Valenta and Rigby 
(1968) have shown that rats could use the 
change in odor created by electrically shock- 
ing another rat to serve as a discriminable 
stimulus. Food-deprived male rats were 
trained to press a lever for sucrose water solu- 
tion at a base-line reward schedule of Vari- 
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able Interval 1. The air that ventilated the 
Skinner box passed over a normal rat to pro- 
vide a standard olfactory background. Fifty 
milliliters of air drawn from the home cage 
of other odorant animals was injected into 
this ventilation stream. If the odor came from 
an unstressed rat, the availability of reward 
was increased by shifting the schedule to a 
Fixed Ratio 2 schedule; if the injected odor 
was from an animal that had just received an 
electric shock, the subject was subjected to 
lever-press-contingent footshock delivered on 
a Fixed Ratio 1 schedule. The dependent 
variable was the latency to the first lever 
press following injection of the odorized air. 
Each of the six animals showed a significantly 
longer latency when the air was from the 
stressed rat than when the air was from à 
nonstressed animal (means of 8.5 seconds for 
the stressed odor and 1.6 seconds for the 
nonstressed odor). Although no function was 
implied, this experiment adequately demon- 
strated the existence of at least one stress- 
generated odor that is detectable by con- 
specifics. 

Muller-Velten (1966) described the re- 
sponse of mice to the odor from other mice 
which had been handled roughly, a response 
he called an alarm reaction or angstgeruch- 
By an arrangement of one-way doors, 2 
mouse (both Mus mus musculus and Mus 
mus domesticus were used) housed in the test 
apparatus was forced to negotiate a long 
passageway from the living chamber to the 
food chamber. A portion of that passageway 
was divided to provide two parallel and vif* 
tually identical paths (choice arms) to the 
food chamber. A flow of compressed odoriZe 
air was introduced into the middle of one ? 
the choice arms and unodorized air into Ё 
middle of the other. Odor was produced P° 
passing the compressed air over the odora" 
subject or over other test substances. The 4! 
flowed against the direction of animal move’ 
ment and was exhausted from the chor 
point. The odorant animal was made fear 
by picking it up and blowing on it. In аб 
tion to testing odors from the whole gu 
the animal's urine, feces, and footpad se" 
tions were also tested. ЕГ 

Tests with nonfrightened odorant anim? 
resulted in the odorized choice arm being 9^ 
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d аа, When the odorant animal 
aroile 3 ened, the odorized pathway was 
esi eder but after adaptation by the 
The е А it became the preferred pathway. 
iti Ше es пе. was found to be contained 
Seseto ine and not the feces or the footpad 
and Кз tt Mas most effective for 8 hours 
ie ineffective by 24 hours. Urine 
duced nei rom the population as a whole pro- 
Y neither attraction nor aversion. 

hs Чен study detailing the use of olfaction 
the envi nunicate the existence of stress in 
(1969) ee has been conducted by King 
of light o ale albino rats received 35 pairings 
while re HA with an unavoidable footshock 
shüttle ip in the start half of a one-way 
specific ia Following specific delays and 
Bee He iege to the shuttle box, they 
the safe 5 i. times for escape latency with 
groups ro of the box accessible. Different 
tenden of subjects were tested for escape 
a T2 after delays from offset of training 
manipul; 6, 12, and 24 hours. The second 
ао ation. consisted of three different 
(a) ina conditions of the box during testing: 
ast а апа untouched since the subject 5 
after pe trial; (b) cleaned immediately 
hosing y e use by disassembling it, then 
air; or vith hot water, and drying with hot 
Used [o (c) cleaned immediately and later 
ا‎ Conditioning of another rat within 
11 ity immediately prior to the escape test. 
Shine s tested in a clean shuttle box 
delay ted extremely slow escape speeds at all 
tom Pili These times were not different 
NO es ne speeds that would have resulted if 
second; ape attempts Were made within 60 
group : (the default value). In contrast, the 
rat Eni in the presence of odor from а 
high es had recently been conditioned showed 
escape cape speeds at I-hour delay but their 
Not dus at all other delay intervals was 

he € from default base-line. 
Butane an which was tested while the box 
ing е the odor from their own jain 
unction ing produced a U-shaped escape-spee' 
Were E Speed at 1-, 12-, and 24-hour delays 
fresh o the same high magnitude as е 
Mase ]i ог group and were different from thg 
E hes observed in the clean boxes. Speeds 
~ and 6-hour delays were also similar to 


the fresh-odor group in that they were low 
and not different from base line. ` 

A second experiment examined escape 
speeds at long delays in the presence of ap- 
propriately old but unfamiliar training odors. 
The escape speeds were high and did not 
differ significantly from those of the equiva- 
lent group tested in the presence of their own 
old odor. King points out that the results 
obtained with appropriately aged odors dupli- 
cated the Kamin effect (Kamin, 1963), that 
is, “retention of a fear-motivated avoidance 
response is a U-shaped function of postcondi- 
tion delay [King, 1969, p. 594]." Inappro- 
priately fresh fear odor disrupted that effect 
at 12 and 24 hours, while an apparatus 
cleaned of odor sources from fear-conditioned 
animals did not elicit an escape response to 
the conditioned stimulus at all. 

These findings of King's seem to indicate 
the existence of a system for the olfactory 
marking of dangerous places. A more thorough 
understanding of the functional significance 
of this phenomenon would be enhanced by 
two additional types of experiments to test 
(a) the effect on avoidance performance 
caused by removal of а well-established arti- 
ficial training odor that had previously served 
to identify the training chamber and (5) ex- 
periments to sort out the nature of the con- 
ditioned-stimulus-odor pairing, for example, 
would a preconditioned flashing-light condi- 
tioned stimulus and the appropriately aged 
fear odor evoke escape response from an 
otherwise novel chamber? 

It is reasonable to believe that rodents can 
communicate specific information about stress 
bv olfaction. Valenta and Rigby (1968) indi- 
cated that stress induces an odor which is 
detectable by other rats. Muller-Valten 
(1966) demonstrated a withdrawal from the 
odor of a stressed conspecific. King (1969) 
appears to have found olfaction used in mark- 
he later recognition of dangerous 
At the present time, there is no 
data to indicate whether the same odor was 
being generated in all three experiments ОГ 
whether each experiment tapped one of @ ы 


lation of olfactory signals. 
"n ted odors. McHose and 


Frustration-gener@ ] 
Ludvigson 6) observed that animals 
tested late behaved in à 


ing and t 
locations. 


in a runway study 


28 E. FRED SCHULTZ AND JACK T. TAPP 


^ fashion appropriate to the reinforcement 
schedules of subjects that had been in the 
runway at an earlier time. This accidental 
finding prompted a follow-up study by Lud- 
vigson and Sytsma (1967). They composed 
a double-alternation running schedule in a 
runway for two groups of female rats. The 
first group was run in blocks so that all mem- 
bers consistently experienced a rewarded trial, 
or they consistently experienced an unre- 
warded trial. The second group was run in 
blocks so that half of all the preceding trials 
were rewarded and half were unrewarded. 
Speed of running in the middle 1-foot section 
of the goal box was reported. 

The experiment demonstrated that rats 
from the group that ran in consistently re- 
warded blocks or unrewarded blocks ran 
faster for rewarded trials (about 2 feet per 
second) and slower for nonrewarded trials 
(less than 1 foot per second). For the group 
of animals that as a group was neither con- 
sistently rewarded nor frustrated, there was 
no difference in speeds between rewarded and 
unrewarded trials. Furthermore, treatment 
differences showed up in extinction suggesting 
that training served to direct the animal's 
attention to stimuli that announced a change 
in the rules of the game. The group trained 
with reward of their predecessors consistent 
with their own reward state extinguished al- 
most immediately, whereas the other group 
persisted in maintaining high running speeds 
through all eight extinction trials. Both speed 
separation and extinction differences were 
reliable. 

Amsel, Hug, and Surridge (1969) have also 
examined the running speeds of rats in the 
presence of odors resulting from consistent 
reward or nonreward. Male albino rats were 
in а runway for one trial per day but were 
rewarded on alternate days. On any given 
day all animals received rewarded trials or 
all received nonrewarded trials, After 72 days 
the goal-box running speed of the subjects on 
оа days had fallen to less than 

t peed shown on rewarded days. 
The subjects responded faster on 81% of the 
rewarded trials during the last part of 
training. 

These studies have demonstrated the po- 
tential existence of a rat-generated cue that 


can communicate information to another rat 
regarding the presence or absence of reward 
while the animal is physically distant from 
the reward itself. The phenomena is startling, 
and the implication for learning studies in the 
runway are quite obvious. At this point it 
would appear that the animals were respond- 
ing to a chemical cue that accompanied frus- 
tration from nonreward. 

In an attempt to examine the frustration 
odor phenomenon in more detail Davis and 
Ludvigson (1969) trained 40 female rats 
with one trial per day for 30 days to run 
down a runway for 12 food pellets. The sub- 
jects were divided into four groups, created 
by applying one treatment from each of the 
following pairs: either reducing the reward 
to one food pellet to induce frustration or not 
changing the reward magnitude, and by either 
swabbing out the interior of the runway after 
every trial or cleaning the runway only prior 
to running the first subject from the group- 
Reward shift produced a decrement in run- 
ning speed, irrespective of whether the appa 
ratus was cleaned between subjects. This €x- 
periment failed to support Ludvigson’s earlier 
demonstration of a frustration odor cue which 
affected subsequent behavior. 

Seago, Ludvigson, and Remley (1970) 
rendered some of their rat subjects anosmic 
by removal of the olfactory buibs, then tested 
them in the double-alternation paradigm. The | 
intact animals showed the same patterning 
when run in squads as had been reported b. 
Ludvigson and Sytsma (1967), that is, fae" 
running on rewarded trials and slow running 
on nonrewarded trials, The anosmic rats "a" 
with uniform speed on all trials, though they 
could pattern their running speed when 
visual cue indicated the presence of rewar 
Intact rats that were run early showed ийе 
difference in speed between rewarded 2^ 
unrewarded trials. Animals run later were £45 
on rewarded trials and slow on nonrewatc : 
trials indicating that the cue was cumulative 
The speed of the anosmic animals run w 
the visual cue was not dependent 0” 
number of previous animals. This data | 
gests that an olfactory cue can communica 
the existence of reward or frustration. 

Wasserman and Jensen (1969) gave in 
male rats four trials per day for five AY 
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a runway for one 97-milligram food pellet. 
Ten of the animals were assigned to the odor- 
ош. group, half of them receiving two 

g trials per day (reward) and the 
other half receiving two extinction trials per 
day (frustration). The remaining animals 
were tested with four rewarded trials per day. 
The first trial each day was run on fresh 
paper, while the remaining three trials of the 
day were run on a counterbalanced sequence 
of fresh paper, paper used only by a rewarded 
odorant rat, or paper used by a frustrated 
odorant rat. ` 

The speed of rewarded animals that ran 
on flooring used by frustrated animals was 
significantly slower than when they ran on 
either new flooring or flooring used by re- 
warded animals. In this study slowing was 
induced in animals that were receiving con- 
sistent positive rewards. 

The experiments of Ludvigson and his 
Colleagues and of Amsel et al (1969) were 
investigating additional slowing in animals 
that were experiencing shifts in reinforcement 
Consistent with the hypothesized odor. This 
gives reason to believe that there may be an 
Odor of frustration but that (а) its effects 
{к not be completely additive with the 
кзы e reward shifts and/or (0) its effects 

у be slight in comparison with the effects 
9f reward shifts. 

, In still another investigation of the emo- 
lion-cued odor, Morrison and Ludvigson 
(1970) attempted to generate odors of both 
reward and frustration to be used as dis- 
Ctiminable stimuli in а T-maze. The rats 
Which served as odor sources Were adapted to 
Зер in the choice point of а T-maze. They 
aed the hypothesized cues when they 
ound food in the food cup (reward) OF failed 
to find it (frustration). А third cue condition 
bey was produced by replacing the paper 

he maze floor with fresh paper. 
м experimental animals Were pes 
a our subgroups. One group received 0 of 
the саи as а cue for turning into een 
аза ins. of the maze and odor of ios; M 
teas ue for an opposite turn. A second 8r = 
ho odor of reward as one cue and сеа Е 
vet et as the other. A third a i 
as сц odor of frustration and а clean floor 

ез. A fourth group received clean floor 


П е cues not produced by the 
experimental manipulations. Y 

Discrimination was achieved within 84 
trials by those two groups that had the odor 
of frustration as a cue, and was not achieved 
by the two groups that lacked the odor of 
frustration. Discrimination was not perfect 
but represented a steady increase over trials 
and was significantly different from the 50% 
average of the other two groups. 

It appears that a cue generated by non- 
reward exists and can be used as a discrim- 
inable stimulus. Odor generated by reward 
did not serve as a discriminable stimulus in 
this test. This conclusion is not only con- 
sistent with all of the other experiments in 
this area but is supported by the Wasserman 
and Jensen (1969) finding that running speed 
is slowed by frustration odor but no speed 
changes occur for the group with clean paper 
and the group with reward odor. 

In summary, there appears to be a dis- 
cernible cue generated by rats when their 
expectation of food reward is frustrated. The 
cue can affect motivated behavior by slowing 
running speeds to food reward even though 
the test subject never experiences the frustra- 
tion himself. The existence of a reward cue 
has not been supported to date. It appears 
highly probable that the frustration cue is 
an odor, but a clear demonstration of the 
olfactory properties of this cue has not yet 
been made. For example, it is possible that 
the cue being used is not a unique odor but 
is a quantitative difference in urination or 


defecation. 


Odors from Other Species 
The influence on the movement of rodents 
by odors from other species is another under- 
researched topic. Such odors could be used 
fensive or avoidance behaviors 


to mediate de ida jor 
or could function in symbiotic or parasitic 
life-styles. 
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in a glass container did not elicit the freezing, 
nor did a normal cat elicit freezing from rats 
Whose nostrils were plugged with cotton. 
Curti (1935) examined the same phenomenon 
on a population of wild and tame rats and 
mice which showed a smaller incidence of 
freezing under any circumstances than Grif- 
fith found. Curti's high variability and low 
number of subjects tended to obscure her re- 
sults, but the data suggested that a glass 
barrier was generally sufficient to prevent 
freezing. 

Reiff (1956) evaluated investigatory be- 
havior by rats and mice in response to urines 
collected on farms, in zoos, and in the labora- 
tory. Urine from domestic animals, in par- 
ticular, the dog, cat, pig, cow, and horse, were 
generally attractive when tested after 2 weeks 
of aging. The same was true of urine from 
other rats, Only one fresh urine tested proved 
to be attractive (from the puma). The mean- 
ing of these results in unclear at present. 

A third study which ingeniously investi- 
gated the response of rats to familiar and 
unfamiliar odors from nonconspecifics was 
done by McCall, Lester, and Dolan (1969). 
They examined exploration patterns of seven 
rats as influenced by the odor of two humans. 
Young female rats were divided between two 
caretakers who provided all maintenance for 
their own animals, including handling once 
à week for 11 weeks, The animals were then 
tested for side preference while in an open- 
field maze. Play objects and caretaker posi- 
tion were Systematically varied about the 
diagonal such that, for any particular subject, 
her own caretaker was standing outside one 
off-diagonal corner and the strange caretaker 
was standing outside the other. The animals 
Were tested twice, once with no lid on the 
open field, and then again using a transparent, 
airtight lid for halt of the subjects, and an 
opaque but olfactorally permeable lid for the 
other half. 

The animals spent Significantly more time 
on the side of their own caretaker under those 
conditions when they could smell him. More 
Preference was shown for the experimenter 
had worn the same unlaundered sweat shirt 
for the entire experiment than for the one 
who had worn a clean T-shirt every day 
(McCall, Lester, & Corter, 1969). Although 


the results could be explained by either a 
preference or an aversion hypothesis, they 
clearly demonstrate the existence of a be- 
havioral influence (and potential bias) which 
had previously been unrecognized. 

The work of McCall et al. (1969) on 
human odor influence, along with the work of 
Valenta and Rigby (1968), Miiller-Velten 
(1966), and King (1969) on odor and stress, 
and the work of the 12 investigators who 
have recently written about frustration and 
its odor, deserve special attention. Not only 
what was found, but also the fact that the 
phenomena were discovered at all, is im- 
portant for a number of reasons. First, all 
of the findings were unpredicted. That is to 
say, there exist no directly comparable phe- 
nomena in human olíactory experience, and 
аз à result, this use of odor by macrosmates 
was not widely anticipated, The fact that the 
phenomena seem reasonable in retrospect 
should provide no comíort since hindsight 
does little to expand the frontiers of knowl- 
edge. 

Second, and more important, these findings 
and others begin to suggest an extensive re- 
assessment of current experimental methodol- 
ogy for all behavioral research using animals. 
To say that odor exists as simply another cue 
to be controlled grossly understates the situ- 
ation. Odor removal techniques will have to 
be vastly improved from the current state 
of the art. Methodology will be further 
changed by the necessity for more compulsive 
techniques involving animal handling and 
running. Investigators will be justified in act- 
ing as though spirits were assisting the con- 
ventional senses of their subjects, 


FEEDING BEHAVIOR 


The regulation of movement in living space 
is also controlled by the animal's nutritional 
requirements, In this context there has been 
а small literature which has examined some 
of the details relating to the capacity of a^ 
odor to influence the rat's movements toward; 
or choice of, foods, 

There is a general acceptance of the prop 
sition that the olfactory system of macros 
matic animals differs from that of man n 
terms of (a) the range of odorants to which 
the system is responsive, and (5) the quanta 
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зға of sensitivity. In a demonstration of the 
extended range of olfactory sensitivity, Miller 
and Erickson (1966) have shown that rats 
could discriminate between pure water and 
water containing various chloride salts (so- 
dium, potassium, lithium, and ammonium) 
using an odor. Four rats (one for each salt) 
Were trained to sniff at two stimulus tubes 
and then to push through the one-way door 
"ea to the tube containing pure water 
АП жя 10-ѕесопа ассеѕѕ to a water reward. 
ейш" зна performed reliably at salt con- 
ions as low as .01 molar. Sectioning 
E olfactory nerves caudal to the cribiform 
е In one subject reduced his performance 
redis cud The olfactory sensitivity 
ЗЕН үсү in this study supports the as- 
bs 1 that rodents possess a range of olfac- 
capabilities unsuspected by humans. 
ee other frequently ignored dimension of 
actory capability in rodents is absolute 
DON sensitivity. Several investigators have 
ried to measure absolute threshold for pure 
ud odorants (Goff, 1961; Moulton, 
Sou]? however, a serendipitous finding by 
ih Ie and Long (1969) best serves to put 
ede ee the rat’s sensitivity to food 
dry 45 They found that the odor from one 
АЧЫ milligram Noyes food pellet provided 
ent cue value for correct T-maze per- 
Огтапсе, 
ari rats were trained to discrimin 
tessa тае from butyric acid in an en- 
айбы T-maze, modified to provide constant 
mene from the goal boxes through the choice 
Meu the start box. Concentration yes 
Dated s was reduced until they were BUE 
» Yet this did not disrupt correct per 
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in oe (a) The forced airflow eliminated 
the eo (and/or motivation) arising es 
an o , Centration gradient which surrount® 
Videq rant in still air; (5) the airflow pró: 
& mechanical gradient to which the sub 


ject could orient. If an airflow-trained subject 
had been tested under the no-airflow condi- 
tion and had been able to perform success- 
fully, havoc would have been wrought upon 
the whole literature of food-reward T maze 
studies. 


Approach Behavior 
| The research dealing with approach and 
feeding behavior is of two types: those studies 
that have probed the attraction that food 


odors have for rodents and those that have 
centered on preferences among food odors. 


Attraction to the Odor of Food 

Sveredenko (1954) investigated the ability 
of yellow-throated forest mice (Apodemus 
flavicollis) and voles (Clethrionomys glare- 
lus) to locate sunflower seeds and hazel nuts 
that had been buried in the ground. The ani- 
mals located seed to depths of 30 centimeters, 
but they reexcavated the path used by the 
experimenter in burying the seeds, indicating 
the potential use of other cues in addition to 
or instead of odor. The animals did not dig 
up empty burial sites, though they success- 
fully dug for nuts that were ground up to 
eliminate tactile cues. 

Howard, Palmateer, and Marsh (1969) 
have constructed a sophisticated apparatus 
that allows three odors to be presented simul- 
taneously in the study of preferences. The 
vered with 3 inches of pine shav- 
etal tart pans 
as a sensing 
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head for body capacitance relays and as ап 
odor site from which emanated compressed 
air. Odor was procured by passing the com- 
pressed air over jars of food before it was 
ducted to the sensing heads. Old odors were 
ed via diffusion through the hardware 
were run overnight for 15 
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other system for measuring the attractant 
properties of odors. It has the potential for 
measuring the amount of work (lever press- 
ing) animals will do to gain access to an odor. 
These studies have all employed methodol- 
ogies which reflect some dimension of “liking” 
for odors by rodents, but, in sum, the con- 
clusion is rather obvious, namely, rodents like 
food odors. 


Preferences among Different Food Odors 


Howard, Marsh, and Cole (1968) have 
examined food-odor preferences in a pro- 
cedure similar to that used by Sveredenko for 
examining attraction. Cereals and conifer 
seeds were buried in open petri dishes under 
5 centimeters of peat, and deer mice (Pero- 
myscus maniculatus) were tested overnight. 
Sugar pine and jeffrey pine seeds were 100% 
detected and eaten; oat groats coated with 
safflower oil showed less detection but were 
completely consumed. Barley showed a high 
rate of detection but low consumption, Empty 
petri dishes were uncovered 20%-50% of the 
time, reflecting the base-line response rate 
above which olfactory cues operated. Wheat 
and oats were detected at the same rate as 
the control dishes. 

In Reiff’s (1956) experiment described 
earlier, he avoided the consumption confound 
of the Howard et al. (1968) study by testing 
isolated odorants. These odorants were prob- 
ably novel to the subjects, but since they 
were extracts, essences, and oils from food, 
and organic acids and amines considered to 
be components of food, the response to them 
is relevant to feeding behavior. The conclu- 
sion that naturally occurring odorants were 
more highly preferred than the isolated com- 
ponents of the mixture is probably valid in 
spite of his methodological problems. 
ea рта ПЭН) ard age 
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ploration pattern, and the sequence in which 
each odor site was visited provided one mea- 
sure of odor preference. The relative quanti- 
ties of oats consumed in the presence of each 
odor was also measured. Each odorant that 
was visited first or had the most oats con- 
sumed in its presence received a scale value 
of 30. If an odorant ranked second, the value 
was 15; the third, 5; and if fourth, 2. Means 
of both the approach and consumption mea- 
sures were made when a substance was tested 
more than once, and a ranking was made of 
all substances for each measure. 

The part of Steinbrecher's experiment that 
is of interest here is that in which separate 
foodstuffs were compared. Since the rats ate 
the food that served as the odorant and since 
they were allowed to make repeated trials 
over the course of an evening, the effect ої 
food preference became confounded with odor 
preference. However. the approach ranking 
observed for the foods was as follows: pork 
fat, salad oil, edam cheese, liptauer cheese 
(all with maximum scores), camembert 
cheese, limburger cheese, seafish, smoked fish, 
milk, meat, smoked herring, butter, eggs: 
carrots (which obtained the mean score); 
bread, salmon, mainzer cheese, oats (the con- 
trol standard), potatoes, bacon, romadut 
cheese, tomatoes, roquefort cheese, and hot 
cake crumbs with vanilla. No generalities can 
be extracted from this experiment. "m 

At this point there are numerous question® 
left to examine with respect to relationship" 
between food odors and attraction. None = 
these experiments suggest any general sa 
vant dimensions of food-odor attractive? 
nor do these studies provide insight - 
issues such as the role of familiarity ve 
wired-in predispositions in generating ri 
ence, or the origin and function of wire it- 
food odor preferences, Other potentially ЇТЇ ', 
ful lines of inquiry might investigate 
relationship between odor preference 
biological state, such as pregnancy; Tap 
prehibernation fatness, posthibernation ^ о 
ger, etc. More research is also necessary ys 
delineate the relationships between food 0 
and nutritional state. 
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havior. In the first of these, some quantitative 
or qualitative aspect of the odor is aversive. 
and the animal shows spontaneous with- 
drawal. In the second, aversion is learned as 
in bait shyness. 


Spontancous Withdrawal 


Barnett and Spencer (1953) evaluated the 
abilities of some odors and flavors to repel 
rats from grain stores. In a captured colony 
of wild brown rats (Rattus norvegicus) which 
normally ate wheat, odors were either mixed 
With the wheat or put in tins and inserted 
Into the wheat boxes. Aniseed odor had no 
effect on consumption but aniseed oil mixed 
2.5% with the wheat caused avoidance 
When unflavored wheat was available. When 
n-butyric acid was mixed with arachis oil and 
Compared against wheat mixed only with 
arachis oi], butyric acid significantly lowered 
Consumption. The odor of n-butyl mercaptan 
alone lowered consumption of whole wheat, 
as did peppermint oil. These experiments had 
to be terminated before a full range of odor- 
ants could be explored systematically. Little 
Can be concluded from the data that were 
Published. 

Millman (1968) found aversion to odors 
While seeking preferences. Compounds serving 
4S odorants were placed in pure form in a 
glass collar surrounding a lick tube which 
Contained sucrose solutions; thus, the air in 
the immediate vicinity of the rat’s mouth and 
Nose was very concentrated with odors. A 
Collar around another tube contained one of 
Several less odorous substances chosen to 
Visually resemble the odorant. The odorants 
Were chosen as representative of each of 

moore's (1952) seven categories. 

lost odorants (i.e, menthone (minty), 
d-camphor (camphoraceous), formic acid 
(pungent), ethylene dichloride (ethereal), 
and butyric acid (rancid) ) caused signifi- 
Cantly less than half of the total consumption 
Of sucrose solution to be drunk from the odor- 
ized tube, None of the odorants enhanced 
Consumption. There are no conclusions to be 
drawn from these studies at present. Mill- 
Man’s technique for measuring aversions is 
teresting, though it is obviously necessary 
© Control for concentrations in future studies 
of this sort, 


Learned Withdrawal 


Bait shyness, or toxiphobia, functions to 
protect organisms from poisoning, and repre- 
sents a conditioned aversion to specific food. 
To experimentally produce the effect, the 
ingestion of a food which is novel in some 
dimension is followed by a sublethal dose of 
a poison. Pain and Booth (1968) adapted 
subjects to daily presentation of 20% solution 
of d-glucose in addition to their ad-lib water 
and chow. On the first experimental day, the 
glucose solution was odorized (and potentially 
flavored) by the addition of 10 “1. of laven- 
der oil per 100 millileters of glucose solution. 
Half of the rats received a sublethal in- 
traperitoneal injection of cyclophosphamide 
following drinking. The presentation of the 
glucose solution was continued, but the pres- 
ence of odor was alternated by days. 

Consumption of nonodorized days was sim- 
ilar to that observed prior to the injection 
phase of the study. During the first two 
odorized presentations, glucose consumption 
fell by approximately 50% for poisoned ani- 
mals while saline injected controls showed 
no significant pattern, Following 9 days of 
extinction and 10 additional days when no 
glucose was presented, the same experiment 
was run on the same animals beginning with 
5 days of habituation. The saline controls 
received the poison, in a slightly larger dos- 
age, and the odorant used was citral. The 
results were essentially the same as those 
observed on the first test. If it can be assumed 
that the odorants did not flavor the solutions, 
then Pain and Booth satisfactorily demon- 
strated that an olfactory cue can mediate the 
toxiphobic response. 

Little attention has been given to the ca- 
pacities of odors to induce withdrawal from 
food or other objects. The olfactory compo- 
nent in neophobia, which wild rats show to 
all new objects in their environment, also 
deserves detailed study. 

The Steinbrecher (1962) study on response 
to food odors deserves consideration apart 
from its study of attraction to and repulsion 
from food odors because of the number of 
substances tested. Using the elevated cross- 
maze with the odorized feeding stations de- 
scribed earlier, a very large number of odor- 
ی ا‎ рт чаа ае. 
foods, commercial scents, extracts, organic 
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RANK ODOR PREFERENCE MEASURED BY QUANTITY OF CONSUMPTION 


Fic. 1. Ranking of odorants on two preference sc: 
neutral values obtained witho 


acids, aromatic hydrocarbons, and esters, An 


unodorized feeding station was always pres- 
ent. Scales reflecting sequence of approach 
and the quantity of food consumed were com- 
puted. The relative noninterval position of 
each odorant on both scales is shown in Fig- 
ure 1. This presentation allows а convenient 
summary of Steinbrecher’s results, 
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presence, while other odors, including methyl 
salicylate, musk ketone, and oil of bitter а 
monds, were visited well after the unûdorz 
site but elicited much consumption. Con 
trated butyric acid always received late 
but elicited the same consumption as t n 
odorless site. Different responses were m 
to natural odorants and their synthetic coun i 
erparts. Real civet was consistently ар 
proached first, and synthetic civet last, b" 
their consumption measures were the same 
Oil of bitter almonds elicited much consump” 
tion, but the synthetic produce saw little 
sumption. Despite the exceptions, ا‎ 
brecher's measures rank correlate .56 puse 
themselves, Unfortunately, no conclusie е 
other than the empirical description are for 
coming from the study. 


RODENT SOCIETIES 
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found that wild rats have a dynamic social 
culture that is influenced by the interaction 
of a large number of factors. The social status 
of any one individual is some complex func- 
ton of living site, nearness to food, age, 
absolute physical size, rate of growth, social 
rank of the mother, social rank of the mother's 
colony, age of other animals associated with 
while a youth, sex ratio of the home colony, 
and success at fighting. The influence of this 
Social status is seen in many facets of rat 
life, including style of walking, time of feed- 
Ing, selection of mates, and reproductive suc- 
cess. Calhoun observed marking, rubbing, and 
sniffing behavior associated with courtship 
and reproduction, which strongly suggested 
that odor was involved. He did not describe 
a role for odor in other kinds of behavior, 
and because experimental manipulations dis- 
Tupted the colony, he did not attempt to 
extensively investigate the effects of odor. 
Other researchers with a more experimental 
bent have directly investigated odor and 
Social behavior and have found a variety of 
Interesting interactions. Two studies indicate 
that varying degrees of social traits influence 
the odors given off by an animal. Several 
Other studies suggest that the reverse is also 
true—odors influence various dimensions of 
Social behavior. 


Influence of Social Traits on Odor 


Two experiments suggest that the odors of 
Tats and mice are responded to differently if 
the odorant animals differ from one another 
along the dimensions of dominance or aggres- 
Sion. To examine these relationships, Carr 
and his associates artificially induced domi- 
nance or aggression in the odorant animals. 
Odor stimuli for subsequent tests were created 
Аас. odorant animals into a cardboard 
bf үш: with a sawdust floor. The ends 

^e cylinders were closed by permeable 
Partitions, and at one end of each cylinder, 
“ee was recessed to form an atrium 
a dead-air space. Approach behavior was 
“cored when the subject’s head entered any 

Portion of the atrium. 
сат, Martorano, and Krames (1970) 
conned a group of CS7BL/6] mice to be ex- 
ated re savage. Following 6 weeks of iso- 
om iving, naive subjects were placed in the 
* cages of the trained fighters and each 


subject was attacked and defeated 10 times. 
Subjects from another group were exposed to 
the fighters but were separated from them 
by a wire screen, while subjects from still 
another group were placed in fighter cages 
behind a wire screen while the fighter was not 
present. АП subjects were then tested in the 
vacated home cage of a fighter mouse that 
contained two test cylinders, one smelling of 
the fighter mouse that lived in that cage 
and the other smelling of an unfamiliar de- 
feated mouse. For subjects from one group, 
the fighter odor was generated by the same 
fighter mice that had defeated them, but for 
the remaining three groups, the fighter was 
{тот unfamiliar fighters. 

Results indicated that all mice spent sig- 
nificantly more time investigating the fighter 
odor than victim odor unless the subject had 
been defeated by the particular fighter, In 
the latter instance, the victim received con- 
siderably more exploration time than the 
fighter. These results provided strong evi- 
dence for the existence of an odor complex 
associated with aggressiveness, and they sug- 
gest an interaction between the nature of the 
odor-complex and the experience of the indi- 
vidual animal in determining the particular 
behavior pattern observed. 

Krames, Carr, and Bergman (1969) ob- 
served somewhat different results in a study 
with rats. A stable dominance hierarchy was 
established in four groups of rats by provid- 
ing limited access to food. All subjects were 
then tested in their own home cage to deter- 
mine the amount of time spent investigating 
two odor cylinders smelling of the most 
dominant rat from an unfamiliar group or 
smelling of the least dominant rat from the 
same unfamiliar group. The odor from the 


least dominant rat received the most in- 
vestigation. 
Both studies demonstrate that rats and 


mice respond differentially to the body odors 
of conspecifics when the odors involved differ 
only in the dominance role of the source ani- 
mal. Thus it seems that odor is at least one 
cue used to communicate social status. 


Influence of Odor on Social Dimensions 


Odor has also been shown to affect aggres- 
sive behavior and to affect behavior related 
to group cohesion. 
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Influence oj Odor on the Dimension of 
Aggression 


Ropartz (1968) paired off members of a 
colony of Swiss albino mice and selected 
those which were judged most aggressive. 
Those chosen were again paired off with one 
another for a second round of encounters 
after each had been sprayed with a commer- 
cial perfume. Eight days later the same ani- 
mals had their olfactory bulbs removed and 
were again paired off. The initial mean 
latency to attack was 35 seconds. Masking 
odors with the perfume raised the latency to 
199 seconds, and aíter bulb ablations no 
attacks occurred. The mean number of attacks 
per animal fell from 26 to 9 to 0. Bulbecto- 
mized mice uniformly submitted to attacks 


by intact mice despite their former aggressive 
nature. 


clean cages, 
but signifi- 
transferred 
d by pairs 
rent strain, 


1966) reported 
у gression in pairs 
of mice housed together was produced by 
ur levels of un- 
m Group 1 were 
On with a cotton 
the home cage. 


Group 4 than for Group 3. The odor of a 
strange mouse added to a familiar mouse in- 
creased aggression while odor of a familiar 
mouse added to a strange mouse reduced ag- 
gression. In all but Group 1, the aggression of 
the home member far exceeded the aggression 
of the experimental member. р 

A similar experiment was reported simul- 
taneously in which the perineal regions of 
both pair members were rubbed, using urine- 
soaked cotton from strange mice instead. of 
themselves, and resulted in the same high 
levels of aggression. 

When taken together, these studies suggest 
that the olfactory complex contributes to the 
elicitation of aggressive behavior in rodents. 
More specifically, the more novel an odor, 
be it the odor of the test environment or the 
odor of a conspecific, the greater the degree 
of aggressive expression. Such a mechanism 
could have the following two effects on the 
regulation of the rodent society: (a) It could 
minimize infighting in an established me 
archy and (b) it could prevent intrusion by 


a novel animal into a well-established colony 
hierarchy, 


Influence of Odor on Group Cohesion 


Odor can facilitate group cohesion by se 
ing as the mediator of individual attraction 
Le Magnen (1952) reported that a sexually 
naive male Wistar albino rat would appro 
another male, a female in diestrus, or à gor 
in estrus rather than approach a place Jack 
ing animal odor. These tests were conduct 
in a small closed metal T-maze with e 
Stimulus animals housed in the ends of the 
Cross arms and separated from the goal d 
by a louvered metal partition. Air was aet 
through the stimulus holders, into the ^ 
boxes, and then through the choice point 2 
to the start box, at 

Muller-Velten (1966) has also found t^ 
mice captured from a wild interbreeding ро, 
ulation of Mus mus musculus and Mus e 
domesticus chose to use the passageway et 


кл 1 nile 
taining the odor of unstressed mice W the 


exploring or seeking food rather tha" e 
unodorized passageway. Fresh group mug 
did not have attractive properties, altho 
males were attracted to female urine. P 


" 
fe х о 2 
ces, even those released in fear, als nd 
tracted. These 


Studies by Le Magne? 


КУ 
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Muller-Velten suggest that some component of 
the body odor of rats and mice attracts con- 
Specifics when the alternative behavior is 
restricted and that there is some suggestion 
of differential sex effects. 

An olfactory influence in social cohesion 
has been found in a series of studies by Marr 
and others on the effects of early experience. 
The general procedure consists of odorizing 
a nesting litter for a specific interval then 
testing the young at a later date for prefer- 
ences between an unodorized animal, an ani- 
mal Scented with a novel odorant, and an 
animal scented with odorant previously ap- 
plied to the subject. An open Y-maze with 
Compartments for the stimulus animals was 
the test apparatus, 

Marr and Gardner (1965) rubbed different 
Tat litters with cloths containing methyl salic- 
ylate, Yardley’s Red Roses Cologne, and for 
Control, odors picked up by a cloth kept on 
mie bottom of the home cage. Mothers were 
Oh ed prior to birth and pups were rubbed 

Се daily from Day 2 until Day 30. In 
RM, the animals preferred the odor of 
time 6 spending about 28% of their total 
iuo any of the stimulus animals and 
in 3% of the time near the animal smell- 

8 of the odor of rearing, 

bes and Gardner also examined the in- 
Mating of early olfactory experience on the 
grou 5 and breeding behavior of the three 
ое Breeding tests with members of the 
betwe treatment group showed no difference 
and in those animals that had been Lupe 
Male in either the amount of time the 
to тозе the female or in the latency 
males st mount. However, the once-scented 
hajp „ NOUnted their once-scented mates only 
as often as did the subjects from the 
cented group, 
on Second test of early olfactory experience 
anq In preference was reported by Marr 
Urin iston (1969), Novel scent was applied 
birth 8 the first, second, or third week after 
Atetopp odorants were used and tested— 
ten none and ethyl benzoate. Subjects 


Dong 


; “Тај edd й 
far] s Significantly longer near the famil- 
Sento j MEd stimulus animal than the un- 


ject © Or novel-scented animal. Those sub- 

Brea Scented during the first week showed 

Scent, Odor preference than did the subjects 
ed later, 


Carter and Marr (1970) followed up this 
indication of a critical period influencing ol- 
factory preference with Guinea pigs (Cavia 
porcellus) that were scented with aceto- 
phenone and with ethyl benzoate for 3 weeks. 
When tested for preference at Day 14 and 
Day 22 the animals showed preferences for 
the odor of rearing. Subjects scented for the 
first 2 weeks and tested a week later still 
showed the same preference. Subjects scented 
for periods of 3 and 5 days during the first 
3 weeks and tested during the second, third, 
and seventh week failed to show preference 
for the familiarly scented stimulus animal 
and chose instead the unscented one. Com- 
parisons of preference values across groups 
revealed that increased exposure yielded a 
monotonically increasing preference for the 
experimental scent and a montonically de- 
creasing preference for the naturally smelling 
animal. To the extent that artificial odoriza- 
tion successfully models the development of 
natural odor preference, natural preference 
appears to be a function of exposure length. 
At this time, there is no evidence for a critical 
period during which exposure must occur 
for preference to develop. 

Schapiro and Salas (1970) have reported 
that the odor of the mother rat will inhibit 
activity in infant rats. They suggest that this 
may reflect an imprinting phenomena related 
to individual recognition. Though the sugges- 
tion is well made, their experiments fail to 
provide a test of this hypothesis since they 
tested only the odor from the mother of the 
test infant. 

For rodents, olfaction is a sensory modality 
with a potentially powerful influence on a 
variety of social behaviors. Since some of 
these odors may be produced by glands, since 
they effect behavior and may be odorless to 
man, it is tempting to view them as phero- 
mones. A note of caution is in order. There do 
appear to exist mammalian pheromones which 
influence the physiological functions of some 
species (primer pheromones), specifically the 
reproductive behavior of some mice (Bruce 
& Parkes, 1960; Dominic, 1966; Lee & Boot, 
1956: Whitten, 1966; Whitten, Bronson, & 
Greenstein, 1968). On the other hand, there 


has been a conspicuous lack of evidence for 


a mammalian releaser pheromone. No im- 


mediate response to airborne chemicals has 
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been reported that is not readily explained 
by conventional learning and olíaction. To 
suggest at this point that some mammalian 
social behaviors are the result of pheromones, 
rather than to other attributes of odors, is 
to invoke an additional complex conceptual 
framework in the absence of need and in the 
face of an adequate and far more parsimoni- 
ous explanation. 


FIXED PATTERNS AND HEREDITY 


Were releaser pheromones to exist for mam- 
mals, they would most likely regulate fixed- 
action patterns or species-specific behaviors, 
that is, the specific movement patterns that 
are exhibited by animals, which take on a 
stereotypical form. These behaviors consist 
of a complex pattern of movements that are 
shared by the majority of the members of a 
Species and are regulated by complex sets of 
stimuli. Only a small part of the literature 
on rodents has investigated the roles of odor 
as critical stimuli in the elicitation and main- 
tenance of such behavior patterns. 

The literature on odor and fixed-action 
patterns has dealt almost exclusively with the 
role of odors in sexual behavior. A major 
exception can be found in a Series of experi- 
ments conducted by Beach and Jaynes 
(1956a, 1956b, 1956c) in which they investi- 
gated maternal retrieving behavior. Ablations 
of the olfactory bulbs resulted in indiscrimi- 
nate retrieval of both the mother's own off- 
Spring and of strange rat pups. The olfactory 
bulbs, and thus, presumably, the sense of 
smell, were not necessary for 


at pups. The study 
970) Which shows 
In the presence of 


borative, The area 
on, 


rather large number of 
Sex and odor 


E. FRED SCHULTZ AND JACK T. TAPP 


sex odors. These factors have included ш 
sex of the subject, the subject’s norm 
readiness for copulation, the hormonal На. 
ness for copulation in the odorant Mer 
previous sexual experience on the part S ed 
subject, and the familiarity of the su "d 
with the test animals. Discrepant findings e 
this literature indicate that there may be $ E: 
other factors that affect the measuremen ч 
sex-odor preferences including potential di 2 
ences in response caused by odors ak ү 
generated from different parts of ше 
and the ever-present difference in sensit! 
among measurement techniques. «ents 
The protocol on most of these ip dae. 
is similar. A subject is typically pne 
with a two-choice preference task. The 0 or- 
that are used as stimuli come from two rated 
ant animals in different physiological 5 M. 
Characteristically the odorant animals е: 
have been a receptive female, а a an 
female, an intact male, a castrated ma ©, к 
a clean-air control in which no anime ani 
presented as a subject. The subjects хреїї“ 
selves differ in sex, in degree of sexual P here 
ence, and in hormonal state. As a result, sior 
are 80 possible nonredundant combini р 
made up of one subject animal and two mati 
ant animals. Table 1 consists of a syst mary 
outline of those combinations and a SU! 
of these experiments, imen 
There are differences between expe? nts: 
in the mode of generation of the О тш 
Several of the experiments presented 5 anil 
odors by placing the entire odorant (cat 
or its excrement in a porous pei , 
Loeb, & Dissinger, 1965; Carr, 100070). 
Wylie, 1966; Carr, Wylie, & Loeb, 
Scott and Pfaff (1970) used the odor 
in test tubes, Pfaff and Pfaffman (29 É 
glass bottles containing urine as ОЧОР 5 (grt 
Stern (1970) used cardboard boxes б 
tain the stimulus animals at the or d 
ing. Le Magnen (1953), Mil a os 
(1966), and Pranzarone (1969) blew ^ ap 
the odorant animals. Brill (1967) 05 T 
flooring which had been walked over’ NM 
en nu 
There are also differences betwe en f de 
in the Measurements employed. sever в 0 
Studies included im Table 1 шей ^. a 
Pendent measure the amount of i? 196" 
investigating the stimulus (Carr, et "is pi 
Carr, et а], 1966. Carr, et al., 1? 
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TABLE 1 
Sex ODOR PREFERENCES SHOWN ny MALE AND FE ALE SUBJECTS 
Hormonal state preferred odorant animal 
——— AS аса = 
Prepared to copulate Not prepared to copulateb 
Sexual status of 
о10пргеѓегтед N | | 1 
dorant animal Receptive | Nonre- | Intact | Сах No | Receptive | Nonre- | Cas- 
FETA ceptive tact | trated | > еее | ceptive Intact ste No 
jemals female | "ale | male | edor female female male pos oder 
Male naive subject 
Receptive female 2 | Е 
гета 2), (10) a 2), (5) (7 5) 
Nonreceptive female | (2), 5, 8, it | (5) (2), (5), о E) (3) 
(10) (7) 
Intact male 5 (5) (4) 1 (5) * (5) a) 
oie! male (4) (4) 
TRE (0, 5 (0), 5 5 (7) 0) 
Male experienced subject 
Receptive fe Г 1 | | 
tive female (8) | (3) (10) | 
“onreceptive female | 2, 3, 7, (8), 0) | (3) (10) | 
Intact ma 4 
: ale 4 | 
Aistrated male (4) ш (4) = 
Wedor 6,7 6, (7) 
Female naive subject 
Recepti 
Dtive female 4 1 | 
N 3 lemale 1 | (4) 1 
тг CEDLiVE female а) | (4) (5) m 
Cast; ale a) (5) (2), (9 | 0) 
No gated male 2,0 (2), (5), 9)" 
áo (1) (1) [0] [e (1) 
Female experienced subject 
789 
Recepti TET d 
Non live female 4 4 
Inentecentive female | (4) S w n 
Cast Male (2) 
No ane’ male (2) 2 


indicated by an open number, indiffere 
oeb, & Dissinger, 1965: 
Loeb, 1970—Long: 


06.1 

rats; qoy U mus (7) Pialf & Pfafíman, 1969—Sprague- 
? Map, Scott & Pfaff, 1970— on laboratory mice; (10) S 
> Male Die gonadal! 


? were castrated. 
& 
айтап, 1969; Pranzarone, 1969; Scott 


е 1970; Stern, 1970). Other experi- 
ters used a dual passageway which was 


Oday: 

ээ With body odor and recorded the 

1952 Way chosen (Brill, 1967; Le Magnen, 
, 


üller-Velten, 1966). 

© results that are summarized in Table 1 
10е evidence of order on a large scale. 
Preferg 34 combinations of variables tested, 
combinar was found in only 13 of these 
Only анаа by at least one investigator. 
Yieldeq o cm combinations, however, have 
by Systematic preference when examined 
оп] Ore than one investigator, and likewise 
WO combinations have yielded syste- 


Offer 
Of 


Long-Evans hooded rats; (3) Carr, Loeb, 
ans hooded rats; (5) LeMagnen, 1952— 


act. Females were in natural or drug-induced u 
males were in natural diestrus or were ovariectomized. 


ence by a number in parenthesi: 


umber key: (1) Brill 1967— 
e, 1966—Long-Evans hooded 
ter albino rats; (6) Müller-Velten, 
(8) Franzarone, 1969—Long-Evans hooded 
Dawley albino rats, 

strus, 


awley albino rat 
n, 1970—Spragu 


matic indifference. The remaining combina- 
tions receiving multiple test have produced 
conflicting results. If all of the data are col- 
lapsed and the total pattern presented in 
Table 1 is examined, it appears that the odor 
of a receptive female is preferred to the odor 
of a nonreceptive female, which in turn is 
preferred to the odor of an intact male, which 
is preferred to the odor of a castrated male. 
All of these stimuli are preferred to no odor. 
It should be borne in mind that such a global 
ranking is made in the absence of much data 
and is therefore highly tentative. 

On a smaller scale, conclusions that have 
been made by Carr et al. (1970) warrant 
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closer investigation. For male rats both hor- 
monal readiness for copulation and prior 
sexual experience were jointly necessary in 
order to observe a preference for estral odor. 
In contrast both Le Magnen (1952) and 
Pranzarone (1969) found naive male rats 
preferring odors from estral females. | 

Carr et al. (1970) suggest also that either 
hormonal readiness for copulation or prior 
sexual experience were independently suffi- 
cient to elicit the preferences shown by female 
rats for the odor of intact males over that 
from castrates. In contrast, Scott and Pfaff 
(1970) have found that naive, diestral fe- 
male mice showed significant preferences for 
odors from intact males. When Carr et al. 
(1965) tested such sexually experienced fe- 
male rats, they failed to find preference for 
odors from intact males. It would thus appear 
that hard and fast rules concerning odor ргеї- 
erence, sexual experience, and hormonal readi- 
ness are somewhat premature. 

The individual findings and discrepancies 
are more enlightening than the collective re- 
sults. The experiments by Brill (1967) seem 
to constitute consistent exceptions to the data 
of Le Magnen (1952) suggesting that the 
odor of odor trails laid down in a T maze 
are responded to differently than odors from 
the total body or odor from excrement. 

Measurement differences also may account 
for some of the discrepant findings outlined 
in Table 1. For example, Pfaff and Pfaffman 
(1969) found that if the measure of investi- 
gation was the amount of time spent sniffing 
at the opening of a bottle which contained 


the odorant material, then 
showed no 


odors from г 
Tf, on the ot 


of the glass bottle o 
sniffed, then a di 
preference emerged ranzar 

(1969), Stern ( d. Lem 

Я 5 

Ch assessed 
mal spent at 
Der, resulting 


. ) results also provid i 
"es € an in- 
ght that may account for some incon; 
tencies, Н nud 


is experi 
5 experiment showed that in order 
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{ог male rats to show preference for the odor 
of estral rather than diestral female rats, 
sexual experience with females is necessa и 
The required sexual experience did not Бау 
to result in ejaculation. Sexual -—— 
could be stopped aíter intromission ph АГ 
mounting and the preference would stil E. 
exhibited, although exposure to receptive, T 
males through a wire screen was insufficie 
to allow the generation of preference. 
Carr, Krames, and Costanzo (1970) li- 
a series of experiments which further eum. 
cates the ability to specify the dimensions " 
an adequate sexual experience in develop? E 
of preference for sex odor. The repons a 
rats tested on preferences for odors from n cd 
versus familiar sex partners was PE e 
by at least four variables: the sex © n 
subject, the number of previous sex абз a 
current level of sexual satiation, and чара 
the history of the animal’s sexual expe S 
was monogamous or polygamous. Polyg? time 
males with much experience spent more a 
examining odors from novel than from nsivê 
iar partners while female rats with а 
polygamous experience showed indiffe A 
with respect to the dimension of novelty rog 
sus familiarity. Males with extensive i t 
amous sexual experience failed to a nt 
odor preferences that their polygamous 
erparts had exhibited. . t the 
Sexual satiation was also varied M a 
time of testing by placing all subjects stint 


report 


cooperative partner shortly before yed f 
Half of these subjects were rem pile p 
testing after one ejaculation Е test” 
other half copulated to satiation "i 79 
ing. The number of ejaculations fue? 
medi did not HI , ра 
mediately preceded the test di 


that ^uid 
the odor preferences for those males © ur di 


extensive previous sexual experience тета, 
it effect the indifference shown s^ st 
(all polygamous). When a ge re 
that had no sexual contact prior to Ur. rete 

Copulation was tested, the amount M t 
experience did influence preferent m 05 
formerly naive males that had СОР eter yy 
Satiation showed the normal odor еня рае 
Characteristic of males of extens ей "i 
gamous experience, that is, they рге nose И 
novel odor, However, those males M 

sexual experience was a single pre! 


ce 
s й ге 
Hon did not show differential pref! 


p! 
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Some limited conclusions can be generated 
from these observations: (a) Male rats seem 
to show preferences for sexual odors under 
more circumstances than do females. Refer- 
ence to Table 1 shows that, for males, prefer- 
ences were found in 3 of the 19 combinations 
that were tested, whereas for females those 
preferences were found in only 5 of the 15 
combinations. If the summary is restricted to 
subjects that were hormonally prepared for 
copulation, then 8 of the 11 combinations 
presented to male subjects elicited preference, 
while only 3 of 7 combinations tested with 
females elicited preference. It has been gen- 
erally speculated that the preference dis- 
crepancy between sexes reflects different roles 
In mating behavior, but such a suggestion 
remains to be supported. (0) Odors generated 
by different parts of the body are responded 
to in different fashions. This is suggested by 
the contradiction between Brill's (1967) odor 
trail study and Le Magnen's (1952) body 
odor study, More direct tests of this hypothe- 
Sis would seem to be called for. (c) Different 
Measurement techniques possess different sen- 
Sitivities. (d) It would appear that sexual 
experience does contribute substantially to 
Preferences involving sex odors but the spe- 


Cifics of this interaction are not currently 
Understood, 


The Role of Odor in Sexual Intercourse 


The function of odors in sexual behavior 
has been examined by eliminating the influ- 
ence of odors via olfactory bulb ablations. 

1 of these experiments have been conducted 
On male animals, and all but one have used 
Tats. The change in the behavior following 
ablations has been examined in both naive 
and experienced males. 

n rats, bulb ablations do not eliminatae 
Sexual intercourse (Beach, 1942; Bermant & 

aylor, 1960; Heimer & Larsson, 1967). For 
oe experienced males, bulb ablations do 
„ Prolong the latency to approach a recep- 
Ive female, mount her, or to achieve intromis- 
Sion (Bermant & Taylor, 1969; Heimer & 
arsson, 1967), Bulb ablations in naive ani- 

als, however, were shown to prolong latency 
О intromission (Bermant & Taylor, 1969). 
“Nee intromission was achieved, bulb abla- 
ae generally prolonged the interval until 
Jaculation in both the experienced and naive 


animals (Bermant & Taylor, 1969; Heimer 
& Larsson, 1967). 

The overall slowing effect of bulbectomy 
has also been reflected in the number of ejacu- 
lations achieved during a relatively short 
testing session. The number of ejaculations 
necessary to achieve satiation, however, is 
not affected by bulbectomies. 

All of this evidence suggests very strongly 
that for rats the ability to smell is not neces- 
sary to allow sexual behavior, but it does 
produce an effect through the interaction of 
prior experience. 

In contrast to findings in rats, Murphy 
and Schneider (1969) found that male golden 
hamsters with bulb ablations did not copulate 
at all. They engaged in no social behavior 
with other hamsters except a minimum suffi- 
cient to remove them from disturbing effects 
of others. As a consequence, bulbectomized 
male hamsters initiated no courtship or mat- 
ing behavior. 

The clear-cut discrepancy between re- 
sponses across rodent species precludes any 
global conclusion regarding the role of odors 
in effecting sexual intercourse. Odors appear 
to have some exciting, directing, and main- 
taining effect on male rats, and odors appear 
to be entirely necessary for the male hamsters 
to engage in sexual intercourse. It is clear 
that odors play a complex role in reproductive 
behavior in rodents. 

The simplistic approach adopted by most 
of the investigators discussed here stands out 
in sharp contrast to the complex sex-oriented 
behaviors described by Calhoun (1962a). 
Observations made of wild rat colonies in a 
quarter-acre compound revealed that the ap- 
pearance of a female in estrus caused a large 
number of behavioral changes. Packs of males 
followed her, sniffing and attempting to 
mount. She was not allowed to feed or drink 
in peace. If she attempted to escape into her 
territory or burrow, they would ignore terri- 
torial boundaries to follow her. They would 
station themselves at all entrances to her 
burrow and even follow her into it if she 
remained too long. Aggression by unsatiated 
males was displaced toward other males en- 
countered elsewhere in the compound. In 
colonies dominated by a single male, that 
male protected the female from most of her 
but for females from colonies with 


suitors; 
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multiple males, and therefore, with no pro- 
tection, the number of mountings per night 
was estimated to approach a thousand. W hen 
mounting was allowed, it followed a rigid 
series of stereotyped steps, that is, fixed- 
action patterns. The large number of new 
behaviors and shifts in old behaviors that 
occur when a female is in estrus offer many 
possibilities for the interplay of odor with 
sexual behavior. Further research is certainly 
warranted to elaborate the eliciting and main- 
taining roles of odor in such behavior. 


CURRENT STATUS AND FUTURE DIRECTION 


No consistent generalizations have emerged 
that allow a convenient summary of relation- 
ships between olfaction and rodent behaviors. 
Perhaps this is because research on olfaction 
and behavior suffers under two very serious 
constraints. The major constraint is our ig- 
norance of the qualitative dimension of odor. 
At this time, investigators cannot describe 
odors in terms of relevant stimulus param- 
eters and it is, therefore, impossible to per- 
form experiments that require knowledge of 
the exact differences between olfactory stim- 
uli. In the face of this ignorance, a reasonable 
alternative strategy for research on the role 
of olfactory stimuli and animal behavior is 
to choose odors for investigation that are 
biologically meaningful, that is, those that 
stand in an obvious (or hypothesized) rela- 
tionship to some element of an animal's nor- 
mal behavior repertoire. This strategy en- 
hances the probability of examining odor for 
which an integrated behavior pattern exists 
that, in turn, increases the probability of 
discovering the circumstances of that odor's 
use and the adaptive advantages which there- 


fore accrue. This aproach can yield a better 
understanding of the many Ways in which 
odors are used by animals for coping with 
the problems of life, 


A second constraint en 


countered by investi- 
gators who seek in | Mie 


Sight into the role pl 
fe of rodents is that 
ay not be a viable 
toire as a result of 
ions in the laborator 
and Calhoun ( 


ayed 

the 
part of 
restric- 
Y. Both 
1962a, 19625 
an develop a 
interaction. ai 
their genetic in. 


behavior sought m 
the animal's reper 
tive rearing condit 
Barnett (1963) 
vividly illustrate that rats c 
sophisticated social life with 

many levels, and apparently 
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heritance provides them with the predisposi- 
tions for such development. Yet, standard 
laboratory rearing and housing башыды 
may well deprive animals of the opportunity 
to develop these latent abilities. А case PC 
already been made for doing research pe: 
biologically meaningful odors. A strategy а 
would further optimize payoffs would be 
use subjects that have had adequate chane 
to develop a full range of innate behavior. 

Despite these constraints, research on us 
use of olfaction by rodents to regulate thet 
behaviors has had. considerable study. In ш 
search for generalizations the studies of p 
taneous preferences for odors off the d 
appear at this point to have been fruitless 2 a 
they do not appear to lead in any aped 
direction. On the other hand, there have 2 
several interesting findings concerning ae 
odor by rodents, Rodents appear to be "c 
to give themselves olfactory cues about p 
environment and perhaps via these cues ent 
communicate the state of the eid 
to other animals. There is an odor compl 
in trail marking, in spontaneous alter p 
reactions to stress and frustration, avio. 
dominance and cohesion, and sexual peha and 
These are exciting findings in themselves, Я x 
it would be of considerable interest diffe? 
amine the functional similarities and range? 
ences between these odors. The к not 
of these phenomena are not known & 
understood. strati? 

The fact that some of these demon? a find’ 
have arisen from accidental experiment, and 
ings in the laboratory is of far-reac i “capt 
immediate significance, The olfactoT*, , jg 
bility of laboratory animals si o D 
nored. Indeed, it appears that muc, ae 
world of rodents is viewed «ri a" 
noses. That world appears to be ' 
complex beyond our imagination. 


he 
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MEASURING 


THRESHOLDS: 
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University of Reading, England 


M. Treisman and T. R. Watts argued in 1966 that signal detectability measures 


could be related to those given by the traditional psychophy 


1 procedures, and 


they presented evidence supporting this. P. D. Isaac has since criticized their method 


of analysis. The data are reexamined in 
original conclusion is strengthened, and 


Only if the independent variable has been 
measured with little or no error are we justitied 
m fitting a regression line to joint observations 
(ху) of variables whose true values are V; 
and Y. If error terms affect both ху and yi 
and we assume the underlying random vari- 
ables to be related by F = a. + 8X, the rela- 
tion between the observations is given һу: 


[1] 


А his, which is known as a structural. rela- 
01, is the appropriate model to fit, but al- 
Tough estimates for the parameters have been 
M Since 1879 (Madansky, 1939), the struc- 
Dsycl relation has been largely ignored by 
nl hologists, partly for reasons of unfamil- 


larity z $ « ө. 
п "LY. A valuable contribution to dispelling 
US re 


M =a. + 8, (x, = €) рр 


illustrated it by showing that Treisman 
Vatts (1966), who fitted a regression to 
&ррго data, should have determined more 
Sttond ee a structural relation. Since this 
sults ШО inevitably must cast doubt ой 
heoreti Па һауе been thought to establish а 
1970 0 palet of some interest (D Amato 
Carefy E implications of this criticism require 
ul assessment. 
'S sometimes taken that signal detect- 
A theory (SDT) procedures have super- 


ti 


t 
ШИ 
1* 5 ——- = 
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sity 
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light of this criticism. It is found that the 
a further implication of the data is noted. 


seded those of classical psychophysics. Treis- 
man and Watts (1966) argued that the same 
statistical decision mechanism, operating on 
normal distributions of stimulus effects, could 
underlie both types of task ; if so, exact quanti- 
tative relations should exist between measures 
given by classical and SDT procedures. 
Accordingly, they derived and tested a predic- 
tion relating the method of constant stimuli 
and the SD'T rating method. Two parameters 
are usually derived from the latter, d^ and 
csv/ow, Where S = signal and N = noise. 
They showed that aa, the standard deviation 
of the method of constant stimuli psychometric 
function, corresponds to osy, and that a sta- 
tistic derived from the method of constant 
stimuli, d/o = (/sv — /y)/oar, is related to 
d' by: 


dt = (ox/osx)d'. [2] 


This predicts a slope somewhat less than 
or equal to 1, since usually esy/ew > 1. 
If we define a new SDT measure, d" = 
(Max = My)/osy = (сх/сѕх)0', we can write 
this prediction more simply as: 


[3] 

This form is preferable, since it predicts the 
same slope for each subject, whereas esx/ex 
will vary among subjects. 

Using an auditory intensity-discrimination 
task, Treisman and Watts determined both 
d'aa and d’ for 26 experimental sessions, the 
method of constant stimuli and the rating 
method being interleaved. They calculated the 


regression as: 


dea = ITE Til’. [4] 
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The slope did not differ significantly from 
.67, the reciprocal of the mean osy/oy, nor 
from 1, but was significantly greater than 0. 
'This appeared to confirm the prediction in 
Equation 2. Р | 

Issac (1970) correctly noted that Equation 
2 describes a structural relation, both variables 
being measured with error, and therefore 
fitting such a relation would be most appro- 
priate. Nevertheless, two points mav be made 
in favor of the procedure followed. First, to 
estimate 8, requires prior knowledge of Fe 
or o*.,, the error variances, or А = g*, /g*,, “a 
situation not too likely to arise in the biological 
sciences [Sokal & Rohlf, 1969, p. 483]" and 
which did not appear to arise here. Isaac (1970) 
suggested that A can be estimated from the 
results of four subjects who repeated sessions, 
but this is doubtful: Each shows a positive 
relation between d'u and d’ over sessions. If, 
then, we fall back on regression analysis, we 
may note that it appears to give a conservative 
test of the null hypothesis (taking this to be 
the zero slope implied by a random relation) 
since the regression slope й, = 5.) S°; sets а 
lower bound for ĝ,, which may be estimated by 
B, = Szy/(s*. — o"). 

But though the regression slope may 
service in rejecting the null hypothesis, it pro 
vides limited evidence on the form of the linear 
interrelation between the underlying vari 
(Isaac, 1970); therefore, a new an 
data was devised that al 
Each session was diy 
d'a, d', and d 


be of 


ables 


alysis of the 
lows to be estimated. 
ided into equal halves, and 
derived for each half. (One 
session for which this could not be done 
dropped.) The intrasession vari 
as error variance a 
9*,, апа А, and t 
determined sep: 
first gave: 


was 
ation was taken 
ind was used to estimate Pey 
he structural relation then was 
arately for d’ and for d”. The 


Hot 92 4- Лү — ea) 4+ eye [5] 
from 
from .67 or 1. 
that in Equation 4, which 
» Imply that B, w 
closely ; 
atively low 
by the estimate for 


est’ 
The slope is significantly different 
0 ( = 4.47, P< 001) but not 
It is Very similar to th 
does Dot, of course 
approximate В, 
the rel 


ill always 
this is due here to 
z : tae 

d error variance indicated 


= 2 / ә B 
\ = с ttres Tegs Which was 
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7.49. The formula used for / was: 


AO 


Sr 


l= | (n—2) sin? 10-0], 
[6] 


where б = tan ! d and Û = tan ! dy. (There 18 
an error in the corresponding equation (Equa- 
tion 29.62) in Kendall and Stuart [1961], eng 
in Isaac’s transcription of it.) As А was cal- 
culated from the data, we took df = n — > 
The analysis for d” gave: 


d'a = 93 4-990" — ee) + ea Ul 


with А = 3 The slope differs significantly 
from 0 (0 = 5.02, p < .001) but not from a 
Both slopes are in good agreement with ma 
predictions, and they differ from one anoth 
in the expected direction. de 
It seems that the estimated structural me 
tions fully support the original theory. 2 wal 
ever, two findings deserve further comme 


M 3 е ле | » error 
First, the high А values indicate that ihe nid 
variance for d^a is considerably greater the 


that for the SDT measures. Second, are 
additive constants in Equations 5 and 252 
unexpectedly large, indicating that d'et 00 

to be higher than d’ or d". 


1! 
" av accou! 
A suggestion can be made that may ac 


е а тусе ds СОЙ" 

for both findings. In applying SDT 1t 15 MIT 
ignore possible variation in setting 

mon to ignore possible variation in set js no 


criterion since this extra variance, 020 iance 
He маг 

normally separable from the noise î rio 
ау. Nevertheless, it is likely that some ¢ 

variance ds. always present. In the 

method, subjects had to maintain thr 

. е Й oe " 

arate criteria, and we suggest that gu 

present at a constant and effectively irre add 

level tl I sessi This wou“ i". 

Toughout the session. MEE 

to the noise variance, thus lowering “ 


га! ing 


seio 


. . . А -5€ 

with little difference between the half js 7 onl) 
In the method of constant stimuli, p a sub” 
one criterion to maintain, we suggest U^. ime 


H 
s A ods ol 
Jects were able to reduce o, for periods would 
during the session. Thus g*y and o8 aan! 


E ¢ con9 gy 
tend to be less in the method of © y 


ing., 
E ! | аЛ 2 
stimuli than in the rating method, ^ a) 1 
relatively higher d'ea values. When A ^ du 
particularly low, this sometimes mus ceri? 


to ап especially | 


cries ul 
arge component of ing 
Varlance, 


2 „respo?“ 
making possible a corres?” sta 


ops 
тени e. : co 
large reduction in a, in the method of 
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stimuli with an equivalent rise in d's. This 
would tend to produce a positive intercept in 
Equations 5 and 7. 

Furthermore, if the reductions in 02, in the 
method of constant stimuli occur. intermit- 
tently during a session, this will cause the intra- 
Session variability in d'a to be greater than 
that for the SDT measures as shown by the 
^ values, each of which considerably exceeds 1 
with р < .001. Since falls in e*. would be more 
likely early in a session, subjects relaxing the 
extra effort as they get bored or fatigued, we 
might also expect that d'est „but not the SDT 
Measures, would decline during the session. 
Though not reaching significance, the evidence 
is in this direction. Means for the two half- 
Sessions were, for d’, 2.07, 2.00 (t = 0.28) ; for 
d”, 1.68, 1.52 (t= .71); and for d'e, 2.82, 
2.22 (( = 1.50; p < .1, onetailed). 
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IMPRINTING IN PRECOCIAL BIRDS: 
INTERPRETATION, EVIDENCE, AND EVALUATION ? 


D. W. КАЈЕСКІ? 


University of Michigan 


The identification of the basis for the phenomenon known as imprinting in 
young precocial birds is undertaken from six viewpoints. These are reviewed 
and examined in light of the experimental evidence concerning three aspects of 
imprinting: an early, nondiscriminative period of sensitivity to a wide range 
of parental surrogates, the formation of filial ties with a limited set of those 


surrogates, and the expression of filial 


bonds in subsequent situations. The 


concept of arousal emerges as a salient factor in virtually all interpretations of 
imprinting, but there is presently a divergence of opinion concerning the role 


that arousal plays. 


The experimental investigation of imprint- 
ing in precocial birds has revealed several re- 
lated patterns or aspects of the development 
of filial bonds in the young. These include 
(a) the tendency of newly hatched, naive 
birds to indiscriminantly approach a parent 
or parental surrogate, hereafter referred to 
as the target (this tendency rapidly declines 
with age); (b) the tendency of birds to ex- 
hibit increasing filial behavior toward a target 
presented repeatedly in a single exposure 
session or over more than one exposure ses- 
sions; and (c) the tendency of birds to 
exhibit subsequent filial behavior toward a 
previously exposed target when presented 
simultaneously with a novel target. The re- 
ports of the discovery of these elements have 
been ably reviewed by Bateson (1966) and 
Sluckin (1965). 

These aspects find their counterpart in 
Evans’ (1970) portrayal of the early and 
successive stages in the development of mo- 
bility and recognition in Ring-billed gulls 
(Larus delawarensis). Evans based his report 
оп a large-scale study of imprinting in the 
semiprecocial gulls that included naturalistic 


observation, field experiments, and laboratory 
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experiments. Over the first few days after 
hatching, three developmental stages emerge 
in terms of motor ability: (a) incipient 
(chicks remaining in or on the edge of ш 
nest through the first day), (0) restricted 
(the onset of excursions limited to the vicinity 
of the nest site through the second and {рге 
days), and (c) extended (permanent exi 
tions, typically led by the parents, from t Я 
nest site to other areas from the fourth day 
on). Table 1 reproduces Evans’ develop- 
mental scheme. It can be seen that the 
temporally related entries in the table unde! 
the columns headed by “Recognition” = 
“Approach responses” are akin to the aspect 


TABLE 1 


RANGE or 
APPROACH RESPON 
YOUNG RING-BILLED GUL 


es of 
developmental stages 


Developmental 
stage 


| Stimulus-response correlat 


| 


Recognition 


піса 
respond to a wide| stimuli 
range of stimuli m 
Я : їешаг €^ 
П: Restricted | Species recognition | Toa particip 
mobility | of famili 

stimuli 

i los. area freee 

Individual recog- | To stimuli 255 
nition ated W 


шаг fam’ ys 
individua! 
| i 
Е {то ү 
5 Я isi io 
NE Revised version reprinted with per mission patio 
monograph by R. M. ишнен о ошау gea 
Monographs, 1970, Voj, opyrighted hy Anima 
Monographs, 1970; É | 


Em 


IMPRINTING 49 


of imprinting in precocial birds in the current 
usage. The subsequent evaluations of the 
Interpretations of imprinting will be under- 
taken with two standards in mind: first, how 
Well the interpretation accounts for the re- 
lated elements of the phenomenon and second, 
how well the evidence supports the inter- 
pretation. 


CONTEXTUAL INTERPRETATION 
oF IMPRINTING 


aie (1966) cast an especially critical 
uni - „the proposition that imprinting is 
rw + es that it differs in kind or even in 
be е rom other types of learning. It should 
ing uM that for Bateson the term imprint- 
EN. a rather restricted usage: It is the 
ог кел of the characteristics of the parent 
basi rogate. Concluding that there is little 
tsis for the claim of uniqueness (see also 
^ vein. 1970), Bateson did recognize that the 
Pip M typically subsumed under the 
its a of imprinting does have a “flavour of 
out Wn which needs to be accounted for with- 
e Li mystical [p. 204].” Bateson sug- 
de | that rather than making claims for 
ami a sae it might be more fruitful to ex- 
ing o the unique situation in which imprint- 
Ccurs, 
mat first place, the newly hatched ani- 
Derine, a history of extremely limited ex- 
nce and usually encounters a single tar- 
eni m ап otherwise impoverished setting. 
аге EE the preferences acquired early in life 
e Ned rapidly and have stability be- 
Com of the absence of other acquired or 
Peting preferences. But preference implies 
operate petitive or motivational factors are 
attributes The basis for this motivation 15 
is "a ed to systems that have been estab- 
Drintine a that are operating prior to im- 
One 8 (learning). Bateson simply named 
filia Possible system as "filial behavior." This 
that « ehavior is manifest in the observation 
Classify, rather than passively filtering and 
аенуы 18 their (sensory) input, the birds 
ШУ: search for a certain class of stimuli. 
dite 8 located a member of that class, they 
begin their responses toward it and then 
Th die learn its characteristics [p. 208]." 
hay, it Words, “, . . before their preferences 
een restricted by imprinting the birds 


search for a broad range of conspicuous 
stimuli, On this view the consummatory stim- 
uli that terminate this activity initiate im- 
printing and may be regarded as reinforcing. 
Subsequently the imprinting process restricts 
the range of consummatory stimuli, with the 
result that the birds only search for the ob- 
ject whose characteristics they have learned 
[p. 209]." ` 


Contextual Interpretation Examined 


One question to be asked of the interpreta- 
tion is whether this pattern of events, the 
learning of the parental characteristics, is 
reconcilable with an instrumental or classical 
conditioning paradigm? Clearly, it is not an 
instrumental paradigm because the response, 
approach and following, does not require 
shaping and is emitted prior to conditioning. 
Is it then a classical conditioning paradigm? 
An unconditioned response is indeed present, 
approach and following, and this can become 
the conditioned response. An unconditioned 
stimulus is also present, the conspicuous stim- 
ulus, for this stimulus can automatically elicit 
the response on first presentation and hence 
reinforce the conditioned response to the con- 
ditioned stimulus. But if the unconditioned 
stimulus is the conspicuous stimulus, what is 
the conditioned stimulus? The inconspicuous 
elements of the stimulus in question? It seems 
unlikely. 

If it is difficult to assign imprinting to one 
or another category of learning, it should at 
least be possible to predict consequences for 
performance if the reinforcement is removed. 
According to Bateson, the stimulus that ter- 
minates searching is the reinforcing event in 
the “learning” of the self same stimulus. It 
would seem that the removal of this reinforc- 
ing event should result in extinction. But 
extinction of what? Of the response? No, the 
response persists; the animal continues to 
search. Extinction of the retention of the 
stimulus characteristics? No, the animal will 
prefer the familiar to the novel in a subse- 
quent choice. 

In fairness it should be pointed out that 
e structural change underlies 
the acquisition of the characteristics of the 
target by the imprinted bird. Bateson's later 
work shows that protein and RNA synthesis 


apparently som 
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in the chick brain is affected by an imprint- 
ing procedure (Bateson, Horn, & Rose, 1969; 
Horn, Horn, Bateson, & Rose, 1971; Rose, 
Bateson, Horn, & Horn, 1970). It must also 
be pointed out that it is difficult to reconcile 
such acquisition in an imprinting situation 
with the conventional use of the global terms 
of stimulus, response, and reinforcement. 


ETHOLOGICAL INTERPRETATION 
OF IMPRINTING 


Although agreement is not universal among 
them, it has been proposed by ethologists (cf. 
Hess, 1970; Thorpe, 1956) that imprinting 
is one of any number of behaviors that have 
a fundamentally genetic basis. Such behav- 
ioral patterns are characteristic not of the 
individual but of the species to which the 
animal belongs and the origins of the be- 
haviors are not determined or modified by 
experience, although the expression of the be- 
haviors may come under environmental con- 
trol. These innate behaviors are comprised 
of reflexes, taxes, and fixed-action patterns. 

What triggers these innate behaviors? Not 
all perceived stimuli elicit a particular re- 
sponse. Rather, there is a certain limited 
range of stimuli termed “sign stimuli” or 
“releasers.”” Corresponding to the releasers 
are innate releasing mechanisms which re- 
spond to the external stimulation by provid- 
ing energy for the performance of (or un- 
blocking) the appropriate fixed-action pat- 
tern. 

Thus imprinting in an ethological view 
process that occurs in th 
pression of fixed-action patterns (approach 
and following) as released by certain sign 
stimuli (targets) through the operation of 
innate releasing mechanisms, 


isa 
€ course of the ex- 


Ethological Interpretation Examined 
The etholo 
reflexes, taxes, and fixed- 


large assortment. of objects and events will 
evoke the response. These include, in the 
visual realm alone: humans, balloons, electric 
trains (Fabricius, 1951, 1955; Fabricius & 
Boyd, 1954), footballs (Ramsay, 1951), 
small cubes and cylinders (Jaynes, 1956, 
1957), colored cardboard boxes (Salzen & 
Sluckin, 1959), light flicker (James, bu 
1960), moving horizontal lines (Smith 6 
Hoyes, 1961), rotating discs (Smith, 1960); 
animals of a different species (Baer & Gray; 
1960), and stationary three-dimensional ob- 
jects (Gray, 1960), among others. hat 

It is possible to argue, one supposes, ie 
all of the above stimuli somehow happen P 
fall within the restriction of a genetically 
determined range. This supposition is weak- 
ened considerably by a report by Melvin 
Cloar, and Massingill (1967). They а 
that 7 of 18 Bobwhite quail chicks ie 
on a live sparrow hawk. In fact, one of i 
chicks literally attacked and seized e^ en 
predator subsequently showed a strong vn 
dency to approach the hawk. Mae aje 
responses of young birds may be more lab à 
than the ethologists would lead us to im 
In a tantalizing report on a single subj. 
Klopfer (1956) tells of a domestic m 
(Gallus gallus) that not only imprinted ол : 
cagemate, a swan goose (Anser cygnoides? 
gosling, but went on to adopt certain feedi a 
habits and vocalization patterns of its. pee 

In sum, the examination of imprinting | 
a formidable array of stimuli has rm 
little on which to choose among them prine 
stimuli. This is not to say that all stimuli is 
equally effective (see Gottlieb, 1971)- : 
only to say that the phenomenon вй 
understood merely in terms of the natu 
the stimuli that elicit it. 


CONDITIONING INTERPRETATION 
OF IMPRINTING 


k 
Moltz (1960) observed that Peking w jp 
lings, first introduced to an imprinting ел 
paratus containing a moving target b ity 
8-10 hours of age, showed no emotion it 
and attended to the target only wh ye 
Passed close by. However, when the n thé 
animal was returned to the same situatio rful 
next day, it was quite likely to become 1€ (nd 
and to begin shortly to follow the m? 


> 
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ои when in proximity to the target, 
startle pa ceased making distress calls and 
Molt responses. From these observations 
z made the following deductions: 
s и the initial exposure, some їгас- 
кыы Ше Animal's low-anxiety state be- 
mes conditioned to previously novel stim- 
Uli: the target, ` 
"d Eu Hess of the second exposure, 
xS E ri emotionality and excitement 
jar abo. having been placed in an unfamil- 
ation. This unfamiliarity is based in 
insi the animal's observations of certain 
mem of the runway not previously 
m RM is a result of its increased 
familia: Mity) and in part by having become 
ably th with some other situation, presum- 
„У the rearing area. 
ban that the target has been previously 
fonts. ed with E low-anxiety state, it can 
sion E. a reinforcer on the second occa- 
instru nd mediate new learning. A response 
mental in bringing the animal closer to 
hal should result in anxiety reduction, 
appro = animal should learn the response of 
ds iin hing the target and remaining near it 
ош а5 the anxiety-producing situation 
a ack to exist and the target continues 
Ction as a reinforcer. 


Conditia: А 
nditioning Interpretation Examined 


i Conditioning interpretation holds that 
When tractiveness of any imprinting target, 
hits the animal is in a state of stress, re- 
ar 5 from a previous association between the 

Ret and a certain visceral and cardiac state 
to I animal. This certain state is assumed 
Chara a relatively positive one in that it is 
ampie, ee by Moltz by the term “low 

at s To the contrary, it has been found 
анас ative to controls, birds show stronger 
akes E when first exposure to a target 
tions TM under presumably noxious condi- 
Drinti itz and Ross (1961) found that im- 
exXposee was enhanced when subjects were 
Obser ed to a loud noise when in a position ^s 
birds "e the target. Similarly, EUM 
“ес that received ап optimal a 
Stron € shocks on the initial trial showed 
t Ber following responses when compared 
Nonshocked controls (Kovach & Hess, 


19 
63). In a like finding, Salzen (1970) re- 


ported that chicks are particularly responsive 
to a target consisting of a flat board after 
they have been struck sharply by the board. 
Furthermore, Klopfer and Hailman (1964) 
reported that ducklings exposed to a 10- 
minute cold shock (40* Fahrenheit) immedi- 
ately after initial exposure to an imprinting 
stimulus followed that stimulus 24 hours later 
as vigorously as nonchilled birds. 

Although none of the studies cited report 
an objective measure of the noxious properties 
of the situations involved, similar levels are 
effective in avoidance conditioning in other 
species. These data, then, are not in support 
of the conditioning hypothesis advanced by 


Moltz (1960). 


EPIGENETIC INTERPRETATION OF IMPRINTING 


More recently, Moltz (1963, 1968) made 
the casual observation that it was quite likely 
that a Peking duckling would approach a 
moving target during initial exposure to it. 
Additionally, it seemed that the birds evi- 
denced positive responses only when the 
target retreated from them, not when it ap- 
proached. In order to account for this initial 
approach movement, Moltz appealed to an 
hypothesis advanced by Schneirla (1959) 
that in early stages in behavioral ontogeny, 
stimulus intensity rather than stimulus qual- 
ity is a critical factor. Schneirla (1959, 
1965) argued that early in development cer- 
tain low or decreasing levels of stimulation 
automatically elicited approach responses 
while intense levels elicited withdrawal re- 
sponses. 

Moltz applied this formulation to the be- 
havior of his subjects and concluded that, in 
terms of the maturational level of the duck- 
lings, the retreating target provided the ap- 
propriate level of stimulation to elicit ap- 
proach responses. The animals then followed 
the target as it continued to retreat in order 
to optimize this level of stimulation. А com- 
plementary observation was that if the target 
had occasion to move toward the subject, it 
did not elicit approach and sometimes elicited 
withdrawal responses. 

Why, then, should the same target continue 
io elicit following on the second day? Pre- 
sumably, the maturational state of the animal 
would have been radically altered while the 
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stimulus value of the target remained con- 
stant. Moltz answered this question by again 
invoking the possibility that on the first. ex- 
posure the qualitative characteristics of the 
target may very well become conditioned to 
the visceral states associated with approach 
processes; and attachment on subsequent tests 
is maintained by selective learning with fear 
reduction serving as the reinforcement. 

Kovach (1970) considered the epigenetic 
position in a somewhat different light. He 
contrasted two possible explanations for the 
apparent increase, then decrease, in respon- 
siveness of chicks to a novel target with in- 
creasing age: (a) The critical period analysis 
that holds that the maturational emergence 
of processes that support or compete with the 
approach response account for the behavioral 
disparities over age; (5) the epigenetic analy- 
sis that holds that these differences are func- 
tionally related to the animal's optimal 
arousal in terms of the animal's own sensi- 
tivity and the actual energy changes in the 
environment. In brief, early in life relatively 
high stimulus intensity should evoke ap- 
proach, while later, lesser levels of intensity 
will be sufficient to evoke the same behavior. 
Thus if a target of constant intensity is pre- 
sented to chicks of different ages, it should 
elicit most approach from the birds for which 
it produced an optimal level of arousal. 


Epigenetic Interpretation Examined 


This interpretation maintains that different 
quantitative levels of stimulation are optimal 
in eliciting approach behavior at various 
stages of the bird's maturation. Kovach 
(1970) put this Proposition to a direct test 
by examining the influence on approach be- 
havior of а number of factors including age, 
of the rate, intensity, 
ain visual stimuli, and 


of chicks of variou 
7 days) to a light flickerin 
(from .8 to 8.9 cycles pe 
form of an inverted U, 

mediate frequencies were 
effective regardless of the 
Further, when responsiven 
terms of age with al] freq 


8 at different rates 
T second) took the 
That is, the inter- 
generally the most 
age of the animal. 
€ss was plotted in 
juencies combined, 


the inverted U was again observed. This re- 
lationship was also observed as a function of 
variations in luminance and complexity. Ko- 
vach concluded that the results argued more 
strongly for a critical period rather than е 
epigenetic interpretation of early por 
behavior. He added, however, that some О 
the evidence could be best understood through 
an epigenetic analysis and that it was prob- 
able that elements represented by both for- 
mulations were influencing the animals' be- 
havior. : 

Further, support is lacking for ete. 
(1963) position that retreating targets ane 
provide the amount of stimulation pese 
to elicit approach responses. Moltz himse 
systematically tested the effect of the eina 
ment of the target and reported that while 
à retreating target was most effective, à oi 
get that only approached, or one that er 
nately approach and retreated, could vp 
elicit approach responses on the part of 
subject (Moltz, 1963). Salzen (1970) s 
Schulman, Hale, and Graves (1970) have г 
ported like findings. 


-— N 
INFORMATION-PROCESSING INTERPRETATION 
OF IMPRINTING 


Matthews and Hemmings (1963) under 
took to provide an analysis of four “issues | 
emerging from the research on imprintinh. 
(a) Why do chicks rapidly select particular 
objects in the environment? (b) Why do ү 
follow these? (c) Why does this follow! is 
last for only a limited period? (d) WES 
there a correlation between following behav 
and low anxiety cheeps and the cessation, 
following and overt signs of increased anxiety a 

With regard to Issue a they begin with ter- 
premise that novelty is one of the charac 
istics that draws an organism’s attention re 
a stimulus. Presuming that all stimuli t 
novel to newly hatched birds, they argue ig 
some stimuli are more novel than othe i 
terms of the number of variable dimen at 
they possess. Therefore, the greater the of 
plexity, the greater the attraction. In li€ а 
the term complexity, however, Matthews the 
Hemmings chose to label this propert 
"information content" of the stimulation the 

The basis for this attraction lies ™ 


ai 
potential for the arousal of the bird. Assu” 


15 
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that the emergent bird is in a state of low 
arousal and low body metabolism, the ani- 
mal’s perception of the imprinting target 
Would raise the arousal to a more satisfactory 
point and would increase ambulatory activity. 
Continuing to Issue b, should the target re- 
treat or disappear, arousal level would be 
reduced to a less satisfactory point, and the 
animal would follow or search for the target 
to maximize or reestablish the input. 
hoe following of a particular target would 
i Sa cease (Issue c) when it had lost its 
ap Pom sustain a satisfactory level of 
les ; p other words, when it had become 
ts Men el. This could come about, according 
the - thews and Hemmings, if one considers 
P un ы system as a limited capacity 
this ея channel. А consequence of 
on would be a recoding of the 
order m. stimulation (or information) in 
nie load on the channel. The 
fon a. ls that “the efficiency of this stimula- 
tom the input begins to be reduced as 
e Te phem] recoding process continues 
tna puse in the efficiency of the rein- 
Ces | : hould be caused by the recoding pro- 
amilia, oi Thus the animal follows the 
more = ie less as it begins to seek out 
animal € stimulation. As for Issue d the 
target Blves evidence of anxiety when the 
adequate the environment fails to provide 
€ levels of stimulation. 


Infor 
E 2 mation-Processing Interpretation 
amined 


impii Postulated that the attraction of an 
Certain ing target for a subject will trace a 
initial] temporal pattern. The target will 
a Пор уа approach and following; after 
and th. Owever, this attraction will diminish 
the Pri Subject will search for something that 

"y Bet lacks, 

a "i Dattern was exemplified in the findings 
Pekin Udy by Moltz and Rosenblum (1958a). 
target f ducklings were exposed to a moving 
tive d 9r 25 minutes on each of 15 consecu- 
Showed? 5. For the first 5 days the subjects 
time С striking increases in the amount of 

Pent following the target. Their re- 
Sho “eS over the remaining days, however, 
Ing 9 ап almost equal decline. Now, appeal- 
their hypothesis, Matthews and Hem- 


mings (1963) would claim that up to Day 5 
the target had provided sufficient novelty 
(information) to evoke satisfactory levels of 
arousal and hence following, but that after 
Day 5 the target had begun to lose this ca- 
pacity and the ducklings followed it less. It 
remains, though, to examine a larger body 
of evidence before concluding that a target 
is effective only when i£ is the source of 
arousal. 

For example, Moltz and Rosenblum 
(1958b) reported a study similar to the one 
in which they found a decline in following 
over later training days. In the more recent 
work they applied electric shock to some 
ducklings prior to the last 4 of 10 sessions. 
These showed superior following relative to 
nonshocked controls. Also, ducklings given 
an opportunity to adapt to the runway 
showed less responsiveness to the target com- 
pared to ducklings that were not adapted 
(Moltz, Rosenblum, & Halikas, 1959), and 
ducklings shocked in the runway (in the ab- 
sence of the target) followed more than those 
shocked outside the runway (Moltz, 1963). 
Corollary findings are that the introduction 
of a strange object into the test area will re- 
store the following behavior of birds that have 
ceased to follow the imprinting target (Cand- 
land, Nagy, & Conklyn, 1963; Sluckin & 
Salzen, 1961) and that the strength of follow- 
ing on second exposure is positively related 
to the length of time elapsed since the first 
exposure (Sluckin & Taylor, 1964). 

In general, these findings indicate that 
arousal produced independently of the target 
enhances responsiveness toward a familiar ob- 
ject. What of the role of arousal in naive birds 
in the presence of a novel target? Adrenergic 
stimulants such as ephedrine and epinephrine 
(Kovach, 1964) as well as amphetamines 
(Kovach, 1964; Rajecki & Saegert, 1971) 
have been found to increase the likelihood and 
the strength of imprinting. Complementary 
results accrue from research wherein arousal 
is reduced via drugs. Hess (1959b) found 
that tranquilizers such as meprobamate and 
sodium pentobarbital reduce the incidence of 
approach behavior. Similarly, Bradford and 
Macdonald (1969) demonstrated that sodium 
pentobarbital, administered prior to training 
sessions, interfered with improvement of im- 
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printing over trials in proportion to the 
strength of the dosage. It is important to те- 
port here that in subsequent tests both Ko- 
vach (1964) and Bradford and Macdonald 
(1969) discovered that a given degree of im- 
printing was not drug-state dependent; that 
is, differences observed under different drug 
conditions persisted in the absence of the 
drug at a later time. 

It seems clear that arousal is a major in- 
fluence on the elicitation and persistence of 
this social behavior. In light of the evidence 
presented, however, it is not at all clear that 
the target is the source of arousal as in the 
view of Matthews and Hemmings (1963). 
Although the evidence remains indirect, some 
writers (Hoffman, Stratton, Newby, & Bar- 
rett, 1970) maintain that even the initial pre- 
sentation of an imprinting object has as a con- 
sequence the reduction of arousal as indicated 
by the cessation of distress calls in ducklings. 


NEURONAL MODEL INTERPRETATION 
OF IMPRINTING 


Imprinting for Salzen (1962, 1970) is the 
fixation of a class of objects to which social 
responses, both present and future, are di- 
rected and confined. Salzen accounted for 
the establishment of this fixation by applying 
Sokolov's (1960) neuronal modeling hypothe- 
sis for the orienting reflex in mammals. The 
hypothesis postulates a model-forming system 


in which neuronal assemblies, at the cortical 
level, are set up t 
stimulation. If a s 
lation does not m 


In order to account for 
tendencies, Salzen argued tha 
of the newly hatched chick j 
extremely limited visual a 


perience even though those 


initial approach 
t a characteristic 
s that it has had 
nd auditory ex. 
systems are quite 


functional. This suggests to Salzen that any 
source of stimulation that meets a few simple 
requirements of the visual-auditory syen 
will instigate orientation and fixation or tha 
the distance-receptors neuronal model S 
tems are sufficiently unstructured so that апу 
focal source of stimulation will give only d 
mild degree of mismatch and thus give mild 
activation in the form of orientation, fixation, 
and approach. 

Then, as the length of exposure 
stimulation (target) increases, a correspo А 
ing refinement of the model results, and e 
repeated exposures lead to stronger M€ F 
of orientation (approach and following; 
However, a chick first exposed to a target id 
a relatively advanced age will initially € 
it discrepant with whatever distance-recep, 
models (perhaps of the rearing environmen 
that exist and therefore orient away de 
the unfamiliar. Thus the formation of : е 
ronal models governs both the formatio! 
attachments and the development of 
avoidance of the novel. 


to the 
nd- 


: z»amined 
Neuronal Model Interpretation Examine | 
jon © 


В E " at 
A closer examination of the formula oloV: 


the neuronal model hypothesis e (re 
1960) and the evidence that bears оп а re^ 
views by Lynn, 1966; van Olst, 197 


satisfactory. In the first place, the a 
of Sokolov's hypothesis lies in its exP а in 
not of the attractiveness of familiar 5 гейе* 
but of the habituation of the orienting содй 
with repeated exposures of a stimulus. s jmt | 
habituation of the orienting reflex to d with 
lus is thought to ultimately result in w co 
an organism to ignore the stimulus s se 
tinued presentations may induce slee ex д 
Lynn, 1966). Third, the orienting ТЄ govê 
rects the organism’s attention towa™.. go 


tanti 1 5 
H + H а 
stimuli, not away. Finally, orienting h; 


Р : ac y 
necessarily correlated with арр ry ma 
Dimond (1970) points out, an 01822 grd 


subsequently flee after orienting 07 
stimulus, pre 4 

It would seem, then, that the inte"? ion 
of imprinting in terms of the for ай yt’ 
neuronal models requires an alter in E 
of assumptions, possibly of the fol, ic? 
(a) At hatching there are absent ? 


IMPRINTING ээ 


ш тюй for incoming stimulation with 
© consequence that the inhibition of the 
OUenting reflex to random stimuli is not 
рне (^) The persistence of orienting re- 
кар 10 the general environment results in 
Th arousal state noxious to the animal. (c) 
aires of a salient stimulus (target) 
reflex "s à rapid habituation of the orienting 
remains о that stimulus. (d) The animal 
ing) to i proximity (approach and follow- 
of habit i target to the degree that the state 
arousal Vor is preferable to the state of 
in the by the unfamiliar elements 
s vironment. 
T һе best evidence for the neuronal model- 
form ole i of imprinting, whatever 
of anew a would be some direct measure 
consistent. To date there is evidence not in- 
dence г : with the hypothesis, but this evi- 
a no ur indirect. In Salzen's terms, the 
distanre of highly structured models for the 
should f receptors in the just hatched, one 
or fear ind very little “orienting away from 
(1970) of any stimulus configuration. Salzen 
functio traced the development of fear as a 
Several i of posthatch visual experience in 
есы ы Pendent groups of chicks. Salzen's 
erred б меге hatched in the dark and trans- 
-12 NS illuminated isolation cages when 
> an ta of age. They were then subjected 
fantem pee looming object (the experi- 
time м hand) immediately or at some later 
Much ‘rom 5, 15, or 60 minutes up to as 
Salzer as 1 or 7 days) after the transfer. 
the "A found virtually no response at all in 
ive gei birds but as he tested over progres- 
cks seater lengths of visual experience, 
alzen Degan to flinch, then flee the object. 
"XDerie Concluded that the chicks with more 
ad nce were more disturbed because they 
Neuronal à greater opportunity to form а 
. al model of the housing environment. 
: Interesting to recount here that Mac- 
Ule ‘ (1968) maintained chicks on a sched- 
firs @ Sodium pentobarbital injections for the 
4 a УЗ after hatching. When tested on Day 
degre > these drug-treated birds showed a 
€ of imprintability far greater than birds 


simi] 
(19 tly treated with saline. Macdonald 


6 2 
8 concluded that the tranquilized birds 


Tem М 
al s Een 
Ned sensitive to the imprinting target 


because the drug prevented them from im- 
printing on the environment. 

An implication to be drawn from the neu- 
ronal model interpretation stems from the 
possibility that early posthatch distress (e.g., 
Collias, 1952) is due to the mismatch be- 
tween contemporary stimulation and the neu- 
ronal models engendered by the pre- and 
perinatal environment. Presumably, if some 
element in the postnatal environment was like 
that of the prenatal, the animal would show 
positive responses toward it. 

Some work on the specific influence of pre- 
natal stimulation on postnatal behavior has 
proved inconclusive (Hess, 1959a; Oppen- 
heim, 1968; Ramsay & Hess, 1954). Other 
investigators, however, have offered evidence 
that there is a direct relationship between 
prenatal experience and postnatal behavior. 
Gottlieb (1966) found that stimulation of 
neonate ducklings that involves the perinatal 
call does not have the effect of sensitizing the 
bird to maternal duck calls in general. Rather, 
it directs the bird's attention to the calls of 
siblings in the absence of conspecific maternal 
calls. 

In a study employing contraspecific pre- 
natal stimulation, Grier, Counter, and Shearer 
(1967) exposed chicken eggs to 1-second 
beeps of a 200-Hertz tone throughout Day 
12-18 of incubation. A second batch of eggs 
was incubated in relative quiet. Within 6 
hours of hatching all birds were tested for 
preference for sources emitting 200- or 2,000- 
Hertz tones, Grier et al. found that the chicks 
that had been exposed previously to the 200- 
Hertz tone chose to approach it and not the 
novel sound while the inexperienced chicks 
were indifferent in choice of approach to one 
or the other speaker. In a similar study deal- 
ing with a different sensory mode, Simner ? 
established that in naive chicks there were 
preferred and nonpreferred rates of intermit- 
tent visual stimulation. However, when chicks 
received perinatal exposure to the nonpre- 
ierred rate, Simner found positively modified 
approach-eliciting properties for that rate. 


2M. L. Simner, Effects of prehatch exposure to 
intermittent visual stimulation on posthatch prefer- 
ences for intermittent visual stimulation in chicks. 
Paper presented at the Eastern Psychological Associ- 
ation Convention, New York. April 1971. 
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Dimond (1968) conducted a study that 
bears on the neuronal model interpretation. 
Half of a batch of incubating eggs were con- 
tinuously exposed to a 60-watt light until 
just prior to hatching. The remaining eggs 
were incubated in darkness. АП subjects were 
then hatched and reared in darkness until 
some time prior to being tested for respon- 
siveness to a moving red balloon at 36 hours 
of age. One-third of the chicks were exposed 
to 15 minutes of posthatch illumination by a 
60-watt bulb immediately before the im- 
printing test. Another third of the subjects re- 
ceived 6 hours of such illumination, and a 
final third received 12 hours of illumination. 
For the chicks receiving the shortest post- 
hatch illumination, the dark-incubated sub- 
jects showed greater imprinting relative to 
the light-incubated group in terms of latency 
of first movement and time spent in proximity 
to the target. It may be possible to argue here 
that, at least in the shortest term posthatch 
illumination condition, the chicks in the group 
receiving prenatal illumination found the vi- 
sual stimuli in the test situation discrepant 
with existing neuronal models for visual in- 
put, and thus initially avoided the target. 
This possibility is supported by the finding 
that for the balance of Dimond’s subjects 
(the 6- and 12-hour posthatch treatment 
groups) absolute responsiveness and the dif- 
ference in imprintability between the ex- 
perimental and control groups (light- and 
dark-incubated) diminished as a function of 
increasing periods of illumination prior to 
testing. 

In sum, Salzen's neuronal model inte 
tation is plausible but requires amendme 
adequately fit the data from imprintin 
Search and the direct investigation 


rpre- 
nt to 
g re- 
of the 
ست‎ 
4 i 
Based on these findings it would appear th 
ena 1 at least prenatal auditory im- 
Printing, is conceivable in а natural setting. This 
y the Teports of Simms (1955) 
Series of calls апа responses be- 
one Ey (a Eregarious, essen- 
ore bird) and her young sti 
E ) M still 
and of Gottlieb (1963a) that a [denn 
ood hen begins to vocalize at a time that 
coincides with the pipping of her eggs. The onse, 
of vocalizations occurs between 20- » 


the actual exodus of th 
сазе of the duck. кешш 


at 


36 hours before 
m the nest in the 
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orienting reflex in other species. The BE 
cal assessment of the position rests Wl 
future work. 


CONCLUSION 


Of the several interpretations considered 
here that do not merely dismiss the notion 
of imprinting, several (the ethological, pe 
ditioning, and epigenetic positions) have к, 
received substantial support while at leas 
one (the information-processing position) E 
pears counter to some of the Mes x gn S 
would appear that the neuronal model P 
pretation of imprinting is at least plaus! © 
when examined in the context of the QA 
of imprinting. The idea that the avian joe id 
is actively influenced by pre- and р 
events is long standing (Kuo, 1932a, 193 id 
1932c, 1932d). Further, the neuronal We 
interpretation has provisions for the exp -— 
tion of the following several phases of Р 
cocial avian social behavior: the early wt 
tive period, the formation of social d 
and the preference for the familiar over 0 
targets. 

The clearest impression that emerge 
literature, though, is the importance 
arousal. In all six viewpoints noted, ar? 
is implicated in one or another o 
doubt, writers have fallen back on this uu. 
cept in the absence of another yeu, an 
cept, reinforcement. However, arousal н in 
imprinting situation. might be measur he 
terms of its physiological балела y no 
curious thing is that, apparently, this 7 nten 
yet been done. As seen, some would il in 
that imprinting involves an incre? clair? 
arousal, while others make the aa can 
for decreased arousal. Before this isst t will 
be decided, researchers on imprint n? ду 
have to examine the course of arousal t? 
out all aspects of imprinting. 
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usal 
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ATTENTION AND HEART RATE: 


A CRITICAL APPRAISAL OF THE HYPOTHESIS 
OF LACEY AND LACEY? 


WILLIAM W. HAHN? 


University of Denver 


The hypothesis formulated by Lacey and Lacey concerning attention and heart 


rate is evaluated with respec 


t to its original and subsequent expressions, its 


testability, and research results offering confirmation or disconfirmation of the 


hypothesis. Heart rate deceleration occurs in a va 


many that require attention to the environment, 
of this reaction in actively facilitating certain kinds of sensory-motor 


is lacking. The major contributions that are needed for the 
are clarification of constructs such as 


mental role 
performance 


hypothesis to retain its heuristic value 


“attention” and greater explicitness of several aspe 
s a dependent measure reflecting mediating 


reliance on heart rate alone a: 
mechanisms and cortical effects. 
mediating mechanisms. 


diet theories have generated as much ex- 
research, n the field of psychophysiological 
Minen the past 10 years as the Lacey 
tween апап concerning the relationship be- 
respor attentional variables and the heart rate 
nse. Since its initial formulation, this 
е! һаз been the subject of considerable 
countle and it has provided a framework for 
depender Studies utilizing heart rate as а 
ime to nt variable. It may be useful at this 
Position т an appraisal of the Lacey 
Curred j because some polarization has ос- 
lon dim, the field along an acceptance-rejec- 
овер зат which may be reduced by a 
Well a examination of the theory itself, as 
are = an appraisal of empirical results that 
Cited in support of the hypothesis. 


Wuar Dors тнк HYPOTHESIS SAY? 


um ks simplest and most frequently cited 
the а hypothesis states that acceptance of 
task js ‘ronment or attention to an external 
tion. *SSociated with a heart rate decelera- 

t whereas rejection of the environment, 
tention to internal cues (associative pro- 


Sse: 
Ses Я A 
) is associated with heart rate accelera- 


Campos, 
their 
devel- 


a S 
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riety of situations including 
but support for the instru- 


cts of the hypothesis, less 


of other hypothesized 


tion. However, the theory is really much 
more complex than this so that it must be 
examined in some detail before the various 
expressions of it can be appreciated. 

One can observe the development of the 
Lacey hypothesis in their early concern with 
the general problem of measurement of au- 
tonomic responses and the relationships be- 
tween characteristic autonomic patterns and 
motor impulsivity (Lacey, 1959; Lacey & 
Lacey, 1958). The neurophysiological rela- 
tionships which later play a crucial role in 
the theory were first described as the basis 
for “autonomic-cortical coupling" in the 
Lacey and Lacey (1958) paper on motor 
impulsivity. At this time we see a determined 
attempt. to link neurophysiological research 
results with behavioral phenomena while the 
authors courageously admit to trying to 
develop a theory.? 

The first published presentation of the for- 
malized hypothesis appeared in an article deal- 
ing with the application of psychophysiologi- 
cal measures to the study of psychotherapy 
1959). Tn the process of describing 


(Lacey, 
autonomic indicator can show the 


how one 


3 At various times these formulations are referred 
to as theory or hypothesis or hypotheses, and it is 
seldom clear where theory ends and hypotheses 
begin. We have not attempted to clarify these dis- 
tinctions because of our reluctance to make differ- 
entiations that the authors themselves have not 


made. 
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impact of a stimulus while another does ee 
Lacey cited examples of heart rate or s 
conductance failing to reflect the effects о 
a cold pressor test in some individuals. Hence, 
these measures yielded discordant answers to 
the question of the effectiveness of the stimu- 
lus in producing arousal. Subsequently, in a 
discussion of situational stereotypy Davis 
findings (Davis, 1957; Davis & Buchwald, 
1957; Davis, Buchwald, & Frankman, 1955) 
regarding unexpected changes in the direction 
of a response are cited and original data are 
presented on directional fractionation of heart 
rate and galvanic skin responses (GSR). 
After presenting these data, Lacey proceeded 
to offer some speculations about the reasons 
for the observed directional fractionation. A 
brief review of the blood pressure, carotid 
sinus, cortical-inhibiting mechanisms led to 


the first rudimentary expression of the hy- 
pothesis: 


. where cardiac deceleration is the rule, the sub- 
ject is required primarily to note and detect the 
environment. . . . At the other end, where cardiac 
acceleration is the rule, one can at least speculate 
that the opposite of environmental intake is called 
for. In a sense, then, the acceleration or deceleration 
of the heart could be considered to be something 
like ап instrumental act of the organism, 
either to increased ease of environmental in 
to a form of "rejection of the environment 
1959, p. 199]. 


leading 
take or 
” [Lacey, 


Four years later 
lose the status of “ 
to take on added 
pothesis: 


the hypothesis began to 
Speculative notions" and 
stature as a major hy- 


Kagan, Lacey, & Moss, 1963, p. 174]. Е 
In the same 


paper th 7 is i 
oped further: p € hypothesis is devel- 


for sensory-motor impulsivity 4 p-— E31 

€ results to mean that an pos "s 
ductance was excitatory and Me ipe "i 
presumably inhibitory effects of pe 
but had по detectable effect When i "m 
> low, this latter state itself hein A 
2 g excita- 


come the 
heart rate, 
rates were 
tory [ 


A final expression 


of the hypothesis offers 
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a rewording of it which adds at least one e 
dimension and has produced some p. 
in attempts to test the hypothesis. Again, 
Lacey et al.’s own words, 


Я Р t 
The first study tested a hypothesis . . - that be 
rate will decrease . . . when the subject € 
attention to the external environment . . - when ‘a 
subject “wants to accept” environmental jute: ‘i 
The hypothesis also required acceleration d rane 
served when the subject “wants to reject” env 
mental input . . . [pp. 188-189, italics added]. 


This hypothesis obviously offers a ea 
of difficulties for empirical testing. It ks 
seem to require manipulation and wii 
ment of various combinations of accepta ts 
and attention—difficult cognitive e. 
which should be explored separately an ^ 
combination but probably cannot be. ке 
can one pay attention and neither accep 


) : iffere? 
reject? Is inattention the same, OT y. 50 


“over 


tional variable of “want” to reject, are 2 > 
the hypothesis is of dubious testability 4 
current methods. Yet, the hypothesis "x 
ceived a great deal of testing and 2 
amount of evidence has been presente can 
confirmation of the hypothesis! vd pavé 
this occur? For one thing, investigator pe 
“zeroed in” on one or two aspects о dir 
theory, namely, the acceptance- n ihe 
mension and heart rate change; po e 
factors, such as the role of GSR, nh 
largely ignored. Also, it is a flexible its 
sis which can accommodate the rest?” 
variety of experiments, and finally; or 
aspects of the hypothesis were base 
able and replicable research 
numerous investigators have СО ot 
report. have A 
Later explications of the theory jas 
simplified or clarified these issues ampl on | 
have broadened its scope. For edle of 
is suggested that the cardiac 96 шр! 
leads to sensory facilitation; 
Changes produced within a 5119 9 
cycle also produce cortical arousal 
tion. Not only carotid sinus barorect ors 
aortic baroreceptors and possibly ° and 
Contribute to the mediation proce 


als 


t at jl ay 
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duce cortical facilitation or inhibition (Lacey, 
1967). 

In the most recent offering of both theory 
and data, the heart rate deceleratory response 
15 found to be positively correlated with 
the "contingent negative variation" response 
Which Lacey and Lacey (1970), citing the 
Work of W. Grey Walter and his colleagues, 
characterize as a measure of an organism’s 

readiness to respond.” While it is important 

to relate the heart rate mechanisms to other 
bhysiological responses of behavioral signifi- 
cance, it is also regrettable that this latest 
Version of the hypothesis introduces new 
198165 such as vicarious stress and reintro- 
duces old issues such as response intention 
fom е relationship of cognitive style to car- 
L * eceleration, At times, one has the feel- 
» ы Nat the hypothesis grows too quickly and 
io” outward—expanding to encompass 
"W concepts and measures before results 
"Om earlier investigations can be used to 
"lp settle some of the original issues.* 


NEUROPHYSIOLOGICAL EVIDENCE 


A. 1958 Lacey described neurophysiological 
iki Which delineated the pathways from 
ee Sinus stretch receptors to brainstem 
me anisms, The effects of these baroceptor 
;cüanisms on cortical electroencephalogram 
Ы H 
vid were also outlined. This work pro- 
ed the neurophysiological foundation for 
E theory proposed in 1959 in that it demon- 
by ted that blood pressure changes mediated 
roti aroreceptors in the aortic arch and ca- 
m ч Sinus produce alterations in reticular for- 
ation activity and in cortical excitability. 
th ad been shown that artificial distension of 
cha Carotid sinus of the Flaxedilized cat 
anged an activated EEG pattern into one 
Hi prowsiness or sleep (Bonvallet, Dell, & 
5 * el, 1954). At about the same time it was 
— Wn that the slow wave patterns produced 
tF 
оу example, a recent review 
Роос, M&S a number of attempts 
of No that have produced negative 
liy, © studies are carefully constructed and the nega- 
results probably cannot be dismissed on the 
Methodological or technical weaknesses. It 
е € interesting to see the Laceys’ analysis of 
the «Studies and, if necessary, modifications of their 
at points where relevant data does not offer 
mation of it 


article by Elliott 
s to test the hy- 
results. Some 


basis "е 
Would 


by adrenaline or noradrenaline and the corti- 
cal excitation produced by hypotensive drugs 
were abolished after sinoaortic denervation 
(Nakao, Ballim, & Gellhorn, 1956). Dell 
(1957) and Dell, Bonvallet, and Hugelin 
(1954) suggested that the stretch receptors 
of the carotid sinus produce an inhibitory 
effect on central vasomotor tone and adrenal 
secretion by way of Hering’s nerve, and 
demonstrated that sectioning of this nerve 
abolished the inhibitory effect produced by 
an injection of epinephrine. By demonstrating 
this effect in a prebulbar sectioned prepara- 
tion, they concluded that the inhibition oc- 
curred in the bulbar inhibitory reticular for- 
mation. Bonvallet and Allen (1963) provided 
the final link in this chain of events by show- 
ing that interruption of cardiovascular affer- 
ent input (by sectioning of the glossopharyn- 
geal and vagus nerves) to л. tractus solitarius 
(nTS) led to a prolonged EEG activation pat- 
tern following reticular formation stimulation. 
This suggested that the normal inhibitory 
action of the carotid sinus reflex was lost due 
to interruption of the feedback information 
on which this inhibitory influence normally 
depends. 

At this point it would seem reasonable to 
conclude that both tonic and phasic changes 
in blood pressure or pulse pressure, acting on 
carotid sinus baroreceptors, can produce ob- 
servable changes in reticular formation and in 
the cortical electroencephalogram. However, 
it should be noted that these results do not 
bear very directly on the principal concerns 
of Laceys’ developing hypothesis in which 
the acceptance-rejection task dimension and 
changes in Zeart rate level are the variables 
explored. Neurophysiological corroboration of 
Lacey's hypothesis would involve a study 
which manipulates acceptance-rejection in 
human subjects, while obtaining measures of 
skin conductance, heart rate, blood pressure, 
carotid sinus and aortic baroreceptor output, 
inhibitory influence in Hering's nerve, ac- 
tivity at nTS, and the cortical EEG. This 
experiment, for obvious reasons, has not yet 
been performed. The need for chronically 
implanted recording electrodes may require 
the use of infrahuman subjects, and this 
would necessitate considerable sacrifice in 
ability to equate an acceptance-rejection di- 
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mension with that stimulus-organism char- 
acteristic in the human studies. 


PsyCHOPHYSIOLOGICAL EVIDENCE 


There are several principle lines of develop- 
ment of research relating to Lacey's hypothe- 
sis. The Laceys, turned to studies of re- 
action time and heart rate deceleration during 
the foreperiod, while others began to investi- 
gate alternative explanations for the re- 
ported heart rate changes or added new 
dimensions, such as task difficulty and ver- 
balization requirements, to the already com- 
plicated hypothesis. 

In 1963 Obrist reported a replication of 
the major results previously reported by the 
Laceys. The intake-rejection variable was 
manipulated by use of different tasks, while 
skin conductance, heart rate, blood pressure, 
and respiration were measured. The two 
major hypotheses tested (that tasks requiring 
close attention would result in depression of 
cardiovascular activity and that a noxious 
and a conceptual task would result in sympa- 
theticlike changes in the cardiovascular sys- 
tem) were supported by the results. The 
author was evidently impressed with this con- 
firmation of the theory: 


The present experiment confirms the previous work 
by Lacey on cardiovascular differentiation of various 
stimuli and heart rate acceleration to others [Obrist, 
proposed theoretical model. The direction of the 
cardiovascular changes in response to 
lus among subjects is very consistent 
jects demonstrate heart rate d 
stimuli and, in most respects, 

1963, p. 456]. | 


апу one stimu- 
; 28 of 28 sub- 
eceleration to certain 
is consistent with the 


The next presentations of data came in the 
form of three papers delivered at professional 
meetings.” The Lacey and Lacey paper, “Car- 


5 These Papers were 


NC ы : (а) B. Lacey and . Lacey 
bes me and respiratory die Fl of re 
Ja dh e Барт Presented at the annual meetings 
Washi А ely for Psychophysiological Research, 
and T Lacey, Change n ape, Bley 
a - Lacey, ange in cardiac od te 
action time as a ср stud peus көе 
Sented at the ely ie 


4 ; meetings of the Society abd 
physiological Research, Denver, uo ders 
in J. Coquey and J. Lacey, The e 29 1556 


diovascular and Respiratory Correlates of 
Reaction Time," reported three experiments 
which explored the relationships between 
heart rate variability, heart rate levels, and 
reaction time performance. Using a fixed fore- 
period of 4 seconds, they found a significant 
decrease in heart rate just prior to stimulation 
in 74 of 78 subjects and a rise in heart rate 
iollowing stimulus presentation which led to 
characterization of this response as a V-shaped 
response. Over the entire experiment there 
was a correlation of — .27 between heart rate 
deceleration and reaction time, a correlation 
which rose to — .39 if only trials on which the 
subject was inspiring at time of stimulation 
were evaluated. No significant. correlation 
between heart rate level and reaction time 
was found. 

To determine if individual subjects demon- 
strated a significant relationship between re- 
action time and heart rate deceleration the 
authors attempted to partial out the common 
variance due to time factors by using data 
only from the second half of the experiment 
and excluded all subjects who showed a sig- 
nificant correlation between trial number and 
either heart rate or reaction time. Of the 63 
correlations computed, 50 were negative and 
17 of these were significant at p < .05 or les*: 
The authors state that the probability of ob- 
taining this many significant correlations ® 
less than .001. However, knowing that 15 е 
the original 78 subjects showed significar 
correlations between trial number and the?" 
variables, one wonders to what extent К 
remaining 63 correlations were influenced ^ 


pes | à is 
this third factor of time or learning. (iak a 
even though the correlations were insig? 


cant the fact that heart rate and reaction pa 
varied with trial number to some degree 1“, 
have assisted “chance” in producing ve ns. 
out of 63 significant negative correla и 
Removing the subjects who showed signifie c 
correlations between trial number and тепе 
tion time or heart rate does not remow" ons 
influence of this variable from the corre? The 
between reaction time and heart ае. 


Е = == йй? 
1966. An expanded version of the first PaP% ealt? 
included in National Institute of Mental 
Progress Report MH 00623, June 1965, as арр ит” 
Preprint No, 3. The second two papers are " 

rized briefly in Lacey and Lacey, 1970. 


ATT 


Same argument might apply to the analysis 
ES [сеен to demonstrate that heart 
€ level is significantly correlated with re- 
та time for individuals. Furthermore, it 
not be unreasonable to suggest that 
m rate might decrease during the course 
‘hes experiment due to habituation, while 
"ы. on time performance might be expected 
eL PEN due to practice. Such long-term 
115е5 could contribute to the observed re- 
iy iS, when data are sampled from in- 
non 4 bie over the test period. It would 
ate E re к to have looked at the heart 
When y eration and reaction time perfor- 
ЖЕ when trial number was partialed out 
8 covariance techniques. 
hs pi substantiated conclusion to be 
subjects POM the second study is that the 
duting ү? again showed cardiac deceleration 
showed ne foreperiod ; 10 out of 11 subjects 
10, 5 3% significant. deceleration. Of these 
incre; also showed a significant blood pressure 
ase (median for the group = 1 millimeter 
ad No significant group differences were 
his | between either blood pressure or heart 
"ds reaction time performance, while 
Prepon e for individuals showed a marked 
ыд. iiie of negative correlations be- 
i eaction time and cardiac deceleration. 
with ате third study, a reaction time task 
against ial cael was used to guard 
period Т е possibility that. the 4-second fore- 
On an Of the previous studies was capitalizing 
бы inherent cardiac variability response of 
celerati seconds in duration. The cardiac = 
5 е was again observed in а majority 
ns (34 of 42); no evaluation of rela- 
Was A between heart rate and reaction time 
presented, 
ami the Lacey and Lacey paper, we might 
fo, „ “® that there is a slight relationship, 
dec, 87040 of subjects, between heart rate 
Ship = ion and reaction time. This relation- 
Varies гҹ be expressed by correlation and 
S from — .06 to — .42 if different task 


aree 
Sepa tions and respiratory phase are со 
сола Р? 10 averages out to — 27 over al 
resulte 5 for one entire experiment. These 
fle.. tS Only suggest that averaged data re- 
йн association between heart rate de- 
aci; “ON during the foreperiod and faster re- 


1( Н » K 
PN times; they do not necessarily support 
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an instrumental role of the heart rate re- 
sponse. 

The two papers presented in Denver in 
1966 (see Footnote 5) showed that the heart 
rate deceleration obtained was present under 
different stimulus conditions and that it was 
significantly enhanced by conditions designed 
to produce more motivation in the subjects. 

A number of investigators have examined 
the theory in a developmental context (e.g., 
Chase, Graham, & Graham, 1968; Lewis, 
Kagan, Campbell, & Kalafat, 1966; Steele & 
Koons, 1968; Steele & Lewis, 1968). A de- 
tailed analysis of these studies is beyond the 
scope of this review. However, it should be 
noted that investigators have reported heart 
rate decelerations occurring in a wide variety 
of tasks requiring the child's attention, rang- 
ing from simple stimuli eliciting orienting re- 
actions to conditioning and problem-solving 
tasks. The results from this area of inquiry 
provide some of the strongest supporting evi- 
dence and enthusiasm for the Lacey hypothe- 
sis, although practically none of these investi- 
gations examine the instrumental role of the 
cardiac response nor do they measure the 
response of hypothesized mediating mech- 
anisms.* 

Obrist and his collaborators have published 
a series of studies on reaction time and heart 
rate. Due to results of earlier studies on 
the relationships between electromyogram 
(EMG) and heart rate during classical con- 
ditioning procedures, these investigators have 
evaluated EMG, heart rate, and respiration 
during a reaction time task with a fixed fore- 
period (Obrist, Webb, & Sutterer, 1969; 
Webb & Obrist, 1970). A striking concomit- 
ance was found in each of the measures; 
normalized scores for EMG, respiration fre- 
quency, and heart rate showed an initial in- 
crease from base line with onset of the ready 
signal followed by a marked decline in all 
measures which continued up to the time 
the respond signal was given. Correlations be- 
tween heart rate change and respiration am- 
plitude, frequency and EMG varied from .25 
to .39 and the correlation between mean re- 
action time and amount of heart rate de- 


в For а more thorough treatment of this subject 
r see the recent review by Graham and Jackson 


matte! 
(in press). 
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celeration was .31. The EMG change and the 
respiratory frequency change correlated with 
mean reaction time, .42 and .37, respectively. 
It was also reported that the chin EMG 
appeared to reflect a more generalized form 
of inhibited motor activity, as it could be ex- 
tended to other EMG measures, and eye 
blinks and eye movements also decreased in 
frequency along with heart rate and EMG 
during the foreperiod. 

Generally, Obrist and his collaborators 
have consistently reported a positive relation- 
ship between EMG measures and heart rate, 
for example, both decreasing during the fore- 
period, and a negative correlation between 
these responses and reaction time, that is, 
reaction time performance improving or la- 
tency decreasing. The heart-rate-reaction- 
time relationship is, of course, consistent with 
the Lacey formulation, and the difference of 
opinion arises from different views about the 
functional or instrumental aspects of this 
relationship: 


Lacey (1967) has cited these studies as evidence 
for a feedback mechanism which related cardiac 
deceleration to cortical arousal via a reduction in 
baroreceptor activity. An alternative model . . . views 
the cardiac deceleration as part of a somatic inhibi- 
tory pattern [Webb & Obrist, 1970, p. 390]. 


In recent experiments these investigators 
have attempted to provide a critical test of 
the relaxation versus the inhibitory model by 
striving to block the deceleratory heart rate 
response. The reasoning seems to be that if 
the heart rate deceleration is not allowed 
to occur but the EMG decrease and enhanced 
reaction time performance are still obtained, 
the heart rate must not play a crucial role 
in instrumentally facilitating the reaction 
time performance. While earlier attempts did 
not supply the sought after differentiation 

because the only combination of variables 
which eliminated HR deceleration also elim- 
inated the inhibition of somatic measures 
[Webb & Obrist, 1970, p. 389],” a récent 
study utilizing pharmacological blocking of 
the heart rate response has produced the de- 
Sired results (Obrist et al., 1970). This study 
demonstrated that atropine blocking of the 
heart rate deceleration did not reliably alter 
reaction time performance, and the authors 
concluded that heart rate is not significantly 


involved in these relationships within the 
context of the design of this particular kind 
of study. In this experiment, the authors have 
provided an important test of the general 
somatic inhibition versus feedback models. 
While minor methodological problems exist 
(e.g., it is not certain that initial assumptions 
regarding the effects of various preparatory 
intervals on heart rate must be accepted, nor 
is it clear that the effects of atropine are 
evaluated as completely as possible), the 
major results contribute to a serious weaken- 
ing of the instrumental role of heart rate de- 
celeration in facilitating reaction time pe! 
formance. While these results weaken one 
major aspect of the Lacey hypothesis, they 40 
not invalidate the entire hypothesis; as we 
have pointed out above, and as Obrist et al. 
(1970) rightly conclude, the afferent dis- 
charges from the aortic and carotid Lam. 
ceptors have not been measured. It is stil 
possible that pulse-pressure changes were o 
curring at these sites independent of hear 
rate change, and these pressure changes ш 
have produced the reaction time facilitatto! 
observed. Nevertheless, these studies do raise 
serious questions about the utility of 95108 
heart rate as the only indicator of the by" 
pothesized facilitatory mechanisms and sug 
gest that if these mechanisms play a role 
such situations, they must do so indepe? 
dently of changes in heart rate. pis 
The series of studies by Obrist and 


à issue? 
collaborators have helped to clarify the 1881 


‘ {сай 
and to correct one popular oversimplifica 


а 
of the Lacey hypothesis—that heart be 
alone can be used to reflect the complex F ce 


sure changes at baroreceptor sites and о 
the release of inhibition at nTS. In "° et 
spect, it would seem that an investigator А 9 
ting out to test the hypothesis would not nent 
been content with less than a measure 
of the afferent output over the ninth Т5 
tenth nerves and/or activation of M 
mechanism; but the provocative early ing 
with heart rate, the limitations of М0; pak- 
with human subjects, and the difficulty етей 
ing even pulse-pressure measurements $ eart 
to encourage and proliferate the use a е ВУ 
rate as a principal determinant of 

pothesized mediating mechanisms. . — ps 

A second major of studies 


group 
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focused on the role of verbalization, а vari- 
i not originally treated by the Laceys (at 
not intentionally). These studies demon- 
ањ the importance of the effects of instruc- 
ons to verbalize on the heart rae response 
M point to the possibility of the confound- 
8 of verbalization with acceptance-rejection. 
(see Campos & Johnson, 1966; Campos & 
En. 1967; Edwards & Alsip, 1969; John- 
E ampos, 1967). In two of the studies, 
Mion ed failures to find directional fraction- 
sed AES not impressive because the stimuli 
Bd "E different than those used by Lacey 
tas oe the lack of directional fractionation 
chan ased on a finding of no significant 
rate Bes in either skin conductance or heart 
bn one study only 8 of 18 comparisons 
Som the .01 level of significance in either 
lon; in another, 6 out of 16), suggesting 
Don the stimuli being used. may not have 
mid апу reliable effect in the responses 
ured. 


Even 
VIDENCE FROM STUDIES ON RELATED ÍSSUES 


ut nh a number of studies, ur dei 
Tate wh volving classical conditioning 
ace: which have not been designed to test the 
tain Y hypothesis, but which do reflect on om 
a fae of the hypothesis. For examp e 
Tate “ing of a decelerative anticipatory heart 
In conditioning studies is consistent with 
the acey formulation (if one assumes ыы 
jeep. itioned stimulus, CS, calls the sub- 
$ attention to the environment), and is 
of ci tent with either à substitution visi 
tig, 2581са] conditioning or a view of activa- 
ассе] theory which would propose that an 
with fratory heart rate should be associate 
the fear or arousal associated with the 
lus гейш a noxious unconditioned stimu- 
may Ucs), Thus, the results of such studies 
Ous Provide additional evidence on the vari- 
ing Physiological reactions taking place dur- 
Ho foreperiod and, thereby, add informa- 
ni mı DONE the role of the mediating mecha- 
5 proposed by the Laceys. . 2 
ma Пе example of this kind of investigation 
бодре given. In a study of heart rate con’ 3 
batte in the cat (Hein, 1969), an erteni 
anq Cry of physiological measures was E E 
ИС басһ assessed during а CSF апі G ia 
assical conditioning format with shock 


used as the UCS. While marked variability 
in the several response measures occurred 
among the 20 experimental animals, all even- 
tually achieved a high degree of stability of 
the decelerative-conditioned heart rate re- 
sponse as well as differentiation of the CS+ 
and CS—. Some of the results relevant to 
mechanisms proposed by the Laceys were: (a) 
Blood pressure decreased along with the heart 
rate decrease in response to the CS. (5) There 
were several signs of activation concomitant 
with the conditioned decelerative heart rate 
response including activated cortical EEG, 
pupillary dilatation, and galvanic skin poten- 
tial. (c) Other concomitants of the CR were 
considered to be signs of parasympathetic 
activity or deactivation; these included heart 
rate deceleration itself, blocking of 40-50 
cycles per second spindles in the olfactory 
bulb, and muscular relaxation. While Hein 
does not relate his results to the Laceys’ hy- 
pothesis, it may have considerable import for 
this formulation. For example, here we see 
the often-neglected blood pressure measure- 
ment reflecting a fall during the foreperiod 
concomitant with cortical EEG activation just 
as Lacey has hypothesized," while at the same 
he muscular relaxation observed con- 
sistently by Obrist and his collaborators and 
at least one other sign of deactivation are also 
reported. Hein's (1969) comment that “. . . it 
is difficult to conceive of a single ‘center’ that 
could trigger off such a complex response . . . 
[р. 462]” might be generalized to provide a 
cautionary note for the whole controversy 
over the Laceys’ hypothesis. There is no rea- 
son why all of these responses cannot occur at 
one time, and there is no particular reason to 
focus on a single issue ог a. response Measure 
or two, such as only the baroreceptor reflex 
mechanisms or primarily the heart-rate-EMG 
coupling phenomena. Aside from an arti- 
ficially inflated valuing of parsimony, there 


time, t 


7 Hein (1969) says, “The blood pressure does not 
deceleration and thus the carotid 
sinus reflex is not implicated [p. 461]." Evidently, 
what is meant here is that an initial rise in blood 
pressure did not cause the heart rate deceleration 
through the carotid sinus baroreceptor compensatory 
reflex. This would not, presumably, imply that the 
concomitant decreases in heart rate and blood pres- 
sure could not participate in the cortical activation 


through carotid sinus reflexes. 


rise during the 
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appear to be no real advantages to concep- 
tualizing the complex processes that occur in 
situations like these in such simplified terms; 
on the other hand, the use of multiple mea- 
sures of central and peripheral physiological 
responses may provide a more accurate and 
more informative picture of the events taking 
place. 

Another link in the chain of mediating 
mechanisms is developed by study of the re- 
lationship between discrete phases of the 
cardiac cycle and various measures of corti- 
cal arousal or performance. There have been 
reports of a significant relationship between 
a specific phase of the electrocardiogram 
(EKG) cycle and the number of erroneous 
psychomotor responses (Callaway & Yeager, 
1960), auditory reaction times (Birren, Car- 
don, & Phillips, 1963), evoked responses (Cal- 
laway & Layne, 1964), and auditory thres- 
holds (Saxon, 1970). 

The complications introduced by intrinsic 
ties between preparatory interval and EKG 
cycle length has been the subject of intensive 
study (Thompson & Botwinick, 1970). In a 
series of four experiments, these investigators 
repeatedly find strikingly negative and insig- 
nificant results when the effects of the pre- 
paratory interval are controlled by various 
statistical or methodological techniques. The 
authors note that the discrepancy between 
their findings and earlier positive results may 
be due to the fact that a variable intertrial 
interval was used in their study or to the 
individual differences among subjects; but 
one must agree with Thompson and Botwinick 
that if the phenomenon is dependent on such 
fine adjustments of methodology, its utility 
and generality are of questionable value. 
Space limitations will not permit a more de- 
tailed analysis of this problem; however, it 
should be noticed that there is neurophysio- 
logical evidence which suggests that the num- 
ber of cells in nTS firing in concert with the 
EKG rhythm and the reticular concomitants 
of these effects may be far too small to ac- 
count for the presumed cortical activation. 
For the interested reader this theme is treated 
In more detail in a forthcoming paper.* 


D 


8 Presented in preliminary form as L. F. Delfini 
and J. J. Campos, Signal detection during four EKG 
Cycle phases." Paper read at the annual meeting of 


the Society for Psychophysiological Research, St 
Louis, October 1971. | | 
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One of the major difficulties encountered 
in attempting to test or evaluate the hypothe- 
sis centers on the use of the intervening уап- 
able of "attention." Not only are there in- 
herent definitional problems associated with 
this term, which is seldom adequately opera- 
tionalized, but in addition, there is evidence 
that effects of attention may be confounded 
by verbalization and arousal factors and there 
is a marked temptation to reinterpret the 
unmeasured attentional factors after results 
become known, It is unfortunate that this 
key construct is also one of the least testable 
and most poorly measured aspects of the 
theory. Others have observed this problem 
and appraised it in different ways; for ex- 
ample, Jennings, Averill, Opton, and Lazarus 
(1971), investigating several of the more 
popular variables affecting heart rate-per- 
formance relationships, came to the conclu- 
sion that attention, whether altered by task 
demands, motor demands, uncertainty, 0T 
noxiousness of the stimuli, seems to be the 
common intervening variable which deter 
mines the degree of heart rate deceleration 1" 
an experimental situation. If all of these 
factors represent aspects of attention, it might 
be one of those terms which has lost its s 
fulness through too broad a definition. At is 
logical extreme, virtually any stimulus РЁ” 
ceived by an experimental subject can be 
said to have attentional factors associat? 
with it, and therefore, any concomitant p 
rate change may be presumed to provide "t 
relevant to the Lacey hypothesis. Tt wou ч 
seem that clarification and more iria 
scribed and operationalized definitions of 5 
tention might help the hypothesis retain 1° 
heuristic value. 

A second important element has rem 
obscure throughout the 12-year history ? ast 
theory. That is, how does the entire t 
of heart rate deceleration, barorecePt ng 
flex, and cortical activation begin?" pe" pat 
the literature, one gets the impressio” е 
certain tasks or stimuli “reflexly” d et; 
heart rate acceleration or deceleration der" 
the heart rate reaction is supposed to mo К. 
i he author” 


ain ed 


° This issue was originally brought to t 
attention by Paul Obrist. 
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ate the subject's reaction to these stimuli. 
The temporal relationships among events and 
the mediational mechanisms at this input 
stage of the sequence have not been specified. 
One would suppose that an organism's initial 
reaction to an unfamiliar stimulus would be 
considerably different than that occurring 
later in the sequence after the subject is 
(presumably) more aware of the need to 
reject or accept the stimulus in question. In 
addition, it seems necessary to specify how 
the "appraisal mechanism," apparently a cog- 
nitive and cortical event, produces its effects 
on the cardiac mechanisms involved initially, 
before they, in turn, produce facilitating or 
modulating effects on the cortex. Given that 
the heart, itself, does not have the powers 
of perception, there must be some such first 
stage; and given the complexity of the human 
subject, it seems unlikely that various stim- 
uli—especially the very complex stimuli used 
in many of the studies cited herein—have 
innate cardiac patterns associated with them. 
Currently, several investigators are working 
оп this problem (see Headrick & Graham, 
1969; Wilson, 1969) and the Laceys appear 
to be approaching the issue in their most re- 
cent work in which the heart rate deceleration 
is being related to the contingent negative 
variation EEG component (Lacey & Lacey, 
1970). However, much greater detail on these 
formative mechanisms is needed, and a useful 
addition to the theory would be the elucida- 
tion of the proposed development of such 
heart rate reactions and of the results of 
Cortical effects on cardiac responses. 

A related difficulty in understanding the 
hypothesis stems from the use of the term 
"cortical activation." It seems that at one 
Point cortical activation occurs because heart 
rate decelerates. Later, the functions are re- 
Versed and heart rate decelerates because cor- 
tical activation is “needed” for a given task. 
Why does this cortical activation selectively 
facilitate sensory and sensory-motor tasks but 
'mpair performance of intellectual-associative 
Processes? One might suspect that cortical 
Activation would have varied effects on sen- 
Sory, motor, and emotional responses and 
farning depending on such things as history 
^f the organism, stimulus intensity and state 
Ог arousal level of the organism. An attempt 


at refinement and explication of cortical acti- 
vation and its relation to other organismic 
variables is another area which the hypothe- 
sis could be strengthened. 

A third central problem area is the instru- 
mental role of heart rate, the implications of 
causality, and the degree of importance that 
should be attached to this one response system. 
Although the Laceys appear to have avoided 
the term "instrumental" since a cautiously 
worded passage in the 1959 article, subse- 
quent offerings strongly imply that causality 
is being ascribed to the heart rate and blood 
pressure reactions. While there is little ques- 
tion that blood pressure through carotid sinus 
mechanisms caz influence cortical excitability 
(this is what the neurophysiological evidence 
seems to indicate), whether improved reac- 
tion time performance is primarily a result 
of these mechanisms being activated is an- 
other issue. As we have seen, only a small 
proportion of the variance can be attributed 
to these factors on the basis of current infor- 
mation, and the evidence suggests mainly 
that those subjects who show a slowed heart 
rate during the foreperiod have a slightly 
faster reaction time—to the best of our 
knowledge, no study has shown that an indi- 
vidual can systematically improve his reaction 
time performance by learning to slow heart 
rate more during the foreperiod. 

Heart rate is an exceedingly complex re- 
sponse, affected not only by the stimuli im- 
pinging on the organism (and all the possible 
variations in parameters of these stimuli) but 
also by task requirements, respiration, state 
of the organism, the species investigated, and 
changes due to repeated presentations of a 
stimulus (cf. Headrick & Graham, 1969; 
Jennings et al., 1971; Obrist et al., 1970). 
The Lacey hypothesis appears to have over- 
looked or ignored many of these important 
determinants of heart rate and performance 
variables with the result that many findings 
are not consistent with the notion that heart 
rate deceleration instrumentally facilitates 
performance through cortical excitation. The 
interaction of several of the relevant factors 
is demonstrated in a recent study in which 
task variables, preparatory intervals, atten- 
tion, and noxiousness were all varied and the 
authors report that “all experimental manip- 
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ulations appeared to contribute to cardiac de- 
celeration ... [Jennings et al., 1971].” 

It is clear from this examination of the 
hypothesis that it will take an exceptionally 
clever research design to operationalize and 
differentiate the input variables, extremely 
precise instrumentation to evaluate the posu- 
lated mediating mechanisms, and very de- 
tailed parametric studies to relate these fac- 
tors individually to the performance variables. 
It seems unlikely that, in the near future, 
we will be able to satisfactorily unravel di- 
mensions of attention, acceptance, and pleas- 
antness from each other or obtain an accurate 
measure of total baroreceptor output. 

Nevertheless, the hypothesis has made 
some important positive contributions to the 
development of psychophysiological research. 
Perhaps the most important of these has been 
the role the hypothesis has played in en- 
couraging modification of activation theory. 
Just as advances in neurophysiology necessi- 
tated a change of conceptualization of the 
Ascending Reticular Activating System from 
a functionally and anatomically undiffer- 
entiated network of activating cells to a 
markedly specialized and dynamic activating 
and inhibiting system, so has the Laceys’ 
emphasis on the inverse correlation between 
heart rate and arousal helped to produce a 
modification of the earlier activation positions 
of Duffy (1951) and Malmo (1959). Who 
knows how long the “inconsistent” decelera- 
tive heart rate findings could have been over- 
looked had not the forceful leadership of the 
Laceys called attention in a very provocative 
and positive way to this phenomenon? In no 
way do these advances spell the end of acti- 
vation theory or invalidate it (any more than 
B eds gera of Ше reticular formation 
now clear that the el aee c c 

ien een e correlations among 
Predicted with mec e VAE) measures 
ways occur if we work Н, е 
the arousal continuum; h з engl mee 

: ; here, one must expect 
more complicated and subtle interactions 
among the various arousal and performance 
measures investigated. 

Focus on the decelerative heart rate re- 
Sponse in the context of the Laceys? hypothe- 
sis may also have made this specific response 


more generally acceptable or recognizable. 
From 8 to 10 years ago, there was a bewilder- 
ing array of conflicting results on the heart 
rate response to different situations, including 
classical conditioning (cf. McDonald, Stern, 
& Hahn, 1963; Shearn, 1961) perhaps partly 
because the decelerative anticipatory heart 
rate response did not "fit" with a stimulus 
substitution view of conditioning or with the 
activation that would be presumed to be pres- 
ent during a CS. More recently, the converg- 
ing trends of investigations of the orienting 
response and heart rate during the foreperiod 
of reaction time studies, improved methodol- 
ogy, and more careful evaluation of the heart 
rate response are producing growing consensus 
about the nature of the conditional heart rate 
reaction. 

A further contribution of the Lacey formu- 
lation involves the emphasis it has placed on 
the transactional nature of ANS responses. 
As Averill, among others has pointed out, 
the well-known conceptualization of the ANS 
as a primitive, unresponsive system is steeped 
in the psychophysiological symbolism of an- 
tiquity, and with a few exceptions (cf. Stern, 
1968), it was rare to ascribe important in- 
strumental attributes to this presumably re- 
flexive and untalented portion of the nervous 
system. The Laceys’ elevation of one single 
ANS response system to the status of a major 
mediating mechanism playing an important 
instrumental role in the performance of а? 
organism antedated the explosion caused by 
Neal Miller and his colleagues upon introduc 
tion of their impressive work on instru- 
mental control of visceral responses. Only 
recently, in fact, has this work presented data 
on the possible significance of the heart rate 
response as an important mediating mech- 
anism for emotional and performance attri" 
butes of the organism (DiCara & Weis? 
1969), work which may eventually provide 
an important link with, and perbaps sil 
advantage of, the theory and data generate 
by attempts to evaluate the hypothesi 
Lacey and Lacey. 


7^J. R. Averill, Emotion and visceral ac 
А case study in psychophysiological 
Paper presented at the Society for Psychophys 
cal Research, Monterey, Calif., October 1969. 
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TESTING IF TWO MEASURING PROCEDURES 
MEASURE THE SAME DIMENSION! 


FREDERIC M. LORD? 


Educational Testing Service 


A convenient statistical technique is described for testing the hypothesis that 
two sets of measurements differ only because of errors of measurement and 
because of differing origins and units of measurement. 


This note is concerned with testing the 
hypothesis that two sets of measurements 
differ only because of (a) errors of measure- 
ment, (b) differing units of measurement, 
and (c) differing arbitrary origins for mea- 
Surement. It describes a convenient statistical 
technique that, unnoticed, has become avail- 
able (Villegas, 1964). It will probably be 
preferred to other techniques used for this 
purpose (Forsyth & Feldt, 1969, 1970; Lord, 
1957; McNemar, 1958), though each has 
advantages and disadvantages. 


STATISTICAL PROCEDURE 


The necessary raw data consist of 7 >2 
Teplicate measurements (indexed by the sub- 
Script &) on each of Y people (indexed by a) 
?Y each of two measuring instruments or pro- 
cedures (indexed by g or #). For each instru- 
Ment or procedure separately, compute the 
Usual among-persons and within-persons sums 
of squares. Also, compute the corresponding 
sums of cross products between instruments 
(or procedures). Let W denote the 2 x 2 
matrix of within-persons sums of squares and 
cross products, and let 4. denote the corre- 
sponding 2 X 2 among-persons matrix. In a 
Standard notation, the element of W in col- 
umn g and row A is 


N ж 
2 Ж, (Жок = Won) nae = nz): 


а=1 k=l 
эл: " еа ; 
| This research was sponsored in part by the Per- 
PE and Training Research Programs Psychologi- 
1 aval Research 


= Sciences Division of the Office of N 
nder Contract No. N00014-69-C-0017 and Contract 
Deed Identification NR 150-303 and by Educa- 
Dart; ©ЗШЕ Service, Reproduction in whole or in 
State 15 permitted for any purpose of the United 
э 05 Government, 

M, 340515 for reprints should be sent to Frederic 
tion eee Division of Psychological Studies, Educa 

esting Service, Princeton, New Jersey 08540 


A typical element of A, similarly, is 


md 
r$, (Ny. = x) (xia. — xh.) 


а=1 


Let Fp denote Ше 1 — p percentile of the 
F distribution with N and V(r — 1) degrees 
of freedom, for the numerator and denomi- 
nator, respectively. Compute the matrix: 


M = (r — 1)A — F,W- 


From the product of the two diagonal terms 
of M subtract the product of the two off- 
diagonal terms, thus obtaining the value of 
the determinant |M]. If the determinant is 
positive and if both diagonal terms are also 
positive, then the F test rejects at significance 
level < p the hypothesis Ho stated at the 
beginning of this paper. Otherwise, the hy- 
pothesis is not rejected. This same conclusion 
may be stated more compactly as follows: 
Hy is rejected if and only if the matrix Л/ 
is positive definite. 


Discussion 


The assumptions required for the validity 
of the foregoing statistical procedure and 
conclusions are as follows. Within each repli- 
cation, the errors of measurement for instru- 
ments g and Z for each person always have 
à bivariate normal distribution, The mean 
error is always taken to be zero. No restrictive 
assumptions are made about the variance or 
correlation. parameters of this distribution, 
except that the variances are positive. In 
particular, the errors for g and л may be 
correlated within a replication and within a 
person. The bivariate distribution is the same 
for each replication and for each individual 
measured. For two replications or two Persons, 
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TABLE 1 


DATA FOR SPEEDED AND UNSPEEDED 
ABULARY Т s 


Unspeeded 
test SS 


Sum of 
product- 


Group 


93524.56 76170.30 111220.5% 
533.00  — 547.00 15403.50 
56 75029.30 120028 


Among persons 
Within persons 18. 
Total 


the errors are always uncorrelated with each 
other, 

The unknown relationship between the two 
origins mentioned in Ho, also that between 
the two units of measurement, represent 
“nuisance parameters" for the purpose of 
making a significance test of Ho. Villegas 
(1964) shows that when (and only when) M 
is positive definite, there is no set of values 
for these (unknown) nuisance parameters 
such that an appropriate variance ratio cal- 
culated from them and from the data will 
lie below the p level of significance. 

The significance level in the foregoing 
statement is precisely p. However, when the 
entire procedure is considered as a test for 
Hy, it is known only (assuming that the repli- 
cate measurements are not perfectly corre- 
lated) that the significance level of the test 
will be less than p. Thus, the significance test 
is conservative in the sense that H, will be 
rejected somewhat less often than the p value 
would indicate. It is important in practice 
that the replications meet the assumptions 
stated. Any practice effect between replica- 
tions that increases the within-persons sums 
of squares, for example, will tend to decrease 
the chance of rejecting H,. 


? The author is indebted to Murray Aitkin and to 
Leon Gleser for pointing out and clarifying thi- 
distinction, See also Saw (1966, Sections 2-3) 


NUMERICAL EXAMPLE 


In the numerical example, measuring in- 
struments g are unspeeded 15-item vocabu- 
lary tests; measuring instruments hare highly 
speeded 73-item vocabulary tests. The r — 2 
parallel forms of each test were administered 
to each of № = 649 examinees. The raw data 
are the same as those used in a previous 
numerical example (Lord, 1957, pp. 210- 
212). The required sums of squares and 
products are shown in Table 1. 

The .05 significance level of F for 649 and 
649 degrees of freedom is 1.13. The matrix 
M is easily found (to the nearest integer) 


to be 
72582 76794 
Ke 93815 


76794 
The determinant is positive, so Ho is re 
jected. This agrees with the conclusion 
reached previously by large-sample methods 
(Lord, 1957) under somewhat different as- 
sumptions. 
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SELF-DISCLOSURE: 


A LITERATURE REVIEW ! 


PAUL C. COZBY ? 


University of Minnesota 


Literature relating to the verbal disclosure of information about oneseli is 


reviewed. First, the role of self-disclosure as 
sidered. It is suggested that the failure of res 
tionships between self-disclosure and other personality 


poor development of paper-z 
selves do not relate to actual di 
ested in personality correlates of 
disclosure, The role of self-c 
sidered, and the principle va 
reviewed, However, it is sug: 


ed 


research. The role of 


a personality attribute is con- 
archers to find consistent rela- 
iables is due to the 
sasures of disclosure which them- 
suggested that researchers inter- 
sures of 


closure in interpersonal relationships is con- 
ables that 
that 
such as needs for privacy, have not been adequately treated in dis 
self-disclosure 


appear to facilitate disclosure are 
factors inhibit intimate disclosure, 
sure 
interviewer: 


in experimenter-subject, 


interviewee, and therapist-client relationships is reviewed. Finally, the review 
considers the relationship between self-disclosure and nonverbal aspects of 


communication. 


Self-disclosure may be defined as any in- 
formation about himself which Person A 
Communicates verbally to a Person B. The 
term “self-disclosure” has been used by 
Jourard (e.g. 1964), although other terms 
such as “verbal accessibility" (e.g., Polansky, 
1965), and “social accessibility" (ел, 
Rickers-Ovsiankina, 1956) have been used to 
describe the same concept. "Seli-disclosure" 
is used in the present paper because (a) it is 
the most commonly used term in the litera- 
ture and (b) the term refers to both a per- 
Sonality construct and a process which occurs 
during interaction with others. 


! Preparation of this review was supported by a 
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MEASURING SELF-DISCLOSURE 

Jourard's. Sclf-Disclosure Questionnaire 

The most widely used instrument to assess 
individual differences in self-disclosure has 
been Jourard’s Self-Disclosure Questionnaire 
(JSDQ). The initial instrument, described 
by Jourard and Lasakow (1958), consisted 
of 60 items—10 items in each of six content 
areas: attitudes and opinions, tastes and in- 
terests, work (or studies), money, personal- 
ity, and body. Subjects responded to each 
item by indicating the extent to which the 
information has been revealed to four target 
persons: Mother, father, best opposite-sex 
friend, and best same-sex friend. Items were 
scored as 0—no disclosure to the target person, 
1—disclosure only in general terms, or 2— 
full and complete disclosure about the item. 

Pederson and Higbee (1968) point out that 
subsequent research has employed variations 
of the JSDQ that differ on a number of di- 
mensions, including length of questionnaire, 


target persons, instructions, and n3» ‘the 
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items. Despite such differences (which are 
described at length by Pederson & Higbee, 
1968), there has been a tacit assumption in 
the literature that the various measures are 
equivalent. 

Validity of the JSDQ. Pederson and Higbee 
(1968) obtained evidence for convergent and 
discriminant validity of the 60- and 25-item 
JSDQ by means of multitrait-multimethod 
matrices (Campbell & Fiske, 1959), but there 
was also evidence for variation between the 
two methods of measuring disclosure. The 
JSDQ appears to be independent of intel- 
ligence (Jourard, 1961d; Halverson & Shore, 
1969: Taylor, 1968) providing evidence for 
discriminant validity of the JSDQ. 

Jourard (1961c) reported that productivity 
on the group Rorschach correlated .37 
(p < .05) with total score on the JSDQ, and 
interpreted the finding as supportive of the 
validity of the instrument. 

However, there is little evidence for the 
predictive validity of the JSDQ. Researchers 
have been unable to find a relationship be- 
tween the JSDQ and actual disclosure in a 
situation (Ehrlich & Graeven, 1971; Himel- 
stein & Kimbrough, 1963; Lubin & Harrison, 
1964; Vondracek. 1969a, 1969b) or ratings of 
actual disclosure made by peers (Himelstein 
& Lubin, 1965: Hurley & Hurley, 1969). 
Pederson and Breglio (1968b) did find that 
amount and intimacy of disclosure on written 
self-descriptions were correlated with total 
Scores on the 60-item JSDQ, but not on the 
25-item JSDQ. Burhenne and Mirels (1970) 
found that rated disclosure on written self- 
descriptions correlated exactly .00 with the 
JSDQ. 

Tt is clear that the JSDQ does not accu- 
rately predict actual self-disclosure. The ex- 
planation perhaps lies in the fact that scores 
on the JSDQ reflect subjects’ past history of 
disclosure to parents and persons who are 
abeled "best same-sex friend” and “best 
opposite-sex friend." When actual disclosure 
is measured, the subject is disclosing to an 
experimenter or to peers whom the subject 
nas never met. 

Jourard and his students have recently 
egun using a 40-item questionnaire which 
asks subjects to indicate what they have dis- 
closed to someone in the past, and what they 
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would be willing to disclose to a stranger of 
the same sex (see Jourard, 1969, Appendix 1, 
for a description of the questionnaire). This 
questionnaire has been shown to predict 
actual disclosure in studies by Drag? and 
Jourard and Resnick (1970). These studies 
also indicate that situational variables may 
outweigh individual differences in disposition 
to disclose. 

The problem of the validity of the JSDQ 
is discussed further after the review of at- 
tempts to relate personality variables to self- 
disclosure. 


Other Measures of Self-Disclosure 


Several investigators have developed other 
measures of self-disclosure. Since these mea- 
sures have not been widely used, or were 
developed for specific types of subjects, they 
are discussed only briefly. 

1. West and Zingle (1969) describe a Self- 
Disclosure Inventory for Adolescents. 

2. Vondracek and Vondracek (1971) have 
developed a system for scoring self-disclosure 
by preadolescents in interview situations. 

3. The Social Accessibility Scale consisting 
of 25 items has been described by Rickers- 
Ovsiankina (1956), and a 50-item version !5 
described by Rickers-Ovsiankina and Kusmin 
(1958). The scale differs from the JSDQ 1n 
that (a) subjects are instructed to indicate 
what they would disclose rather than what 
they Lave disclosed and (b) the target pet 
sons are a stranger, an acquaintance, and à 
best friend. Pederson and Higbee (1968) 
correlated the 60- and 25-item JSDQ with 
the 50-item Social Accessibility Scale, and 
concluded that the two measures should not 
be considered equivalent, as has been done 
previously (cf. Jourard, 1964). 

4. Polanskv's (1965) concept of “verbal 
accessibility" has been measured by the 1° 
complete sentence method, degree of agre 
ment with such statements as “the really 
smart guys keep their opinions to ae 
selves,” and reports of caseworkers (Polans*: 
& Brown, 1967). 


ction: ^ 


51, R. Drag. Experimenter-subject intera 
situational determinant of differential levels of ч 
disclosure. Unpublished thesis, Univer 
of Florida, 1968. 


ity 
master's 
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5. Taylor and Altman (1966) scaled 671 
Statements. for intimacy value and topical 
category, Although these statements do not 
Comprise a disclosure questionnaire, the state- 
ments can be used to construct a question- 
naire, The previously discussed measure cur- 
rently being used by Jourard (1969) uses the 
Taylor and Altman statements. A 144-item 
questionnaire using the statements, which 
uses “best friend” as the only target person, 
Correlates .25 (2 < .05) with actual dis- 
closure in an interview situation (Vondracek 
& Marshall, 1971). Although significant, this 
Correlation is not impressive. It would be 
interesting to know if such a correlation 
Would be raised by including "stranger" as a 
larget person. The Statements may also be 
Used as stimulus materials for experimental 
manipulations, and as a means of scoring 
speech content for self-disclosure. 


PARAMETERS OF SELF-DISCLOSURE 


The basic parameters of self-disclosure are 
(а) breadth or amount of information dis- 
closed, (0) depth or intimacy of information 
disclosed, and (c) duration or time spent 
describing each item of information. 

In actual disclosure, intimacy and duration 
Of disclosure appear to be partially indepen- 
dent, with a correlation of -42 between the 
two variables (Vondracek, 1969a). There is 
an inverse relationship between amount and 
intimacy of disclosure such that individuals 


disclose less about more intimate topics 
(Altman & Haythorn, 1965: Fitzgerald, 
1963; Hood & Back, 1971; Jourard & 


Lasakow, 1958; Taylor, 1968; Taylor, Alt- 
man, & Sorrentino, 1969; Tuckman, 1966), 
and the effect appears to hold across cultures 
(Brown, 1965; Melikian, 1962). 

Little work has been done on the content 
of information disclosed, although there is 
Some research on the positive or negative 
aspects of the information. Levin and Gergen 
(1969) found that individuals disclose more 
Positive aspects of themselves when trying to 
€ ingratiating than when trying to be honest, 

Ives tend to communicate more unpleasant 
“clings, including anxieties and worries, but 

= More pleasant feelings than do husbands 
leri Goldston, Cohen, & Stucker, 1963; 

Rer & Senn, 1967). 


SELF-DISCLOSURE AS А PERSONALITY 
Construct 


Self-Disclosure and Family Patterns 


One question one may ask when dealing 
with personality variables is what sorts of 
child-rearing practices and family interaction 
patterns are associated with variations in the 
personality variable in adulthood, 

Dimond and Munz (1967) found and later 
replicated (Diamond & Hellkamp, 1969), 
that later borns show higher self-disclosure 
scores than firstborns, They concluded that 
while firstborns may have greater affiliative 
needs than later borns, they are not as adept 
at satisfying them, at least by establishing 
close relationships with others, 

Jourard (1961b) found that Jewish males 
were significantly higher in disclosure than 
Baptists, Methodists, and Catholics, none of 
which differed significantly from one another, 
However, Low? did not find a difference be- 
tween Jewish and Protestant groups. Cooke * 
obtained a correlation of .22 (P < .05) be- 
tween disclosure to parents and religious 
behavior (e.g., irequency of church attend- 
ance) of protestants. Jennings (1971) did not 
find a relationship between disclosure and 
whether religious beliefs are "liberal" or 
"conservative." Presumably, any effects of re- 
ligious background on disclosure are mediated 
by a closer family orientation, 

A recent study by Pederson and Higbee 
(1969b) found that disclosure to parents was 
correlated with subjects’ ratings of parents on 
such adjectives as close, warm, friendly, and 
accepting. In addition, it was found that 
females who rated the mother as cold, dis- 
trustful, and selfish tended to score high on 
the Social Accessibility Scale which measures 
willingness to disclose to Strangers, acquaint- 
ances, and/or best friends, Thus, it seems 
possible that family relationships аге more 
important in determining who a person dis- 

1A. B. Low. Sex and religious differences in self- 
disclosure as determined by a revised method ot 
scoring the “Jourard Selí- Disclosure Questionnaire.” 
Unpublished masters thesis, University of Florida, 
1964, 

5 Т. F. Cooke. Interpersonal correlates of religious 
behavior. Unpublished doctoral dissertation, Univer- 
sity of Florida, 1962. 
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closes to rather than whether or not the 
person will be a high discloser. Such an 
interpretation is supported by Doster and 
Strickland's (1969) finding that, while, in 
general, high disclosers perceive their parents 
as more nurturant than low disclosures, sub- 
jects from the low-nurturant homes disclose 
more to friends than parents while the reverse 
is true with subjects from high-nurturant 
families. 


Sex, Race, and Cultural Factors in 
Self-Disclosure 


Sex differences. Jourard and Lasakow 
(1958) reported that females have higher dis- 
closure scores than males. This effect has 
been replicated in numerous investigations 
(Dimond & Munz, 1967; Himelstein & Lubin, 
1965; Hood & Back, 1971; Jourard & Lands- 
man, 1960; Jourard & Richman, 1963; 
Pederson & Breglio, 1968a; Pederson & 
Higbee, 1969a). Jourard (1964, Ch. 6) at- 
tributed considerable importance to the ob- 
tained sex differences. The low disclosure of 
males was seen to be directly associated with 
less empathy, insight, and a shorter life-span 
than females. However, a number of studies 
have reported no sex differences in self- 
disclosure (Dimond & Hellkamp, 1969; 
Doster & Strickland, 1969; Low, (see Foot- 
note 4); Plog, 1965; Rickers-Ovsiankina & 
Kusmin, 1958; Vondracek & Marshall, 1971; 
Weigel, Weigel, & Chadwick, 1969; Zief*). 

Jourard (1964) and Plog (1965) have sug- 
gested that conflicting findings concerning sex 
differences may be the result of samples from 
different geographical areas with a concomi- 
tant difference in sex role expectations. Sex 
differences would be found in the southern 
United States, but not in the northern and 
western United States, However, an examina- 
tion of the studies which tested for sex dií- 
ferences yields no consistent pattern which 
would allow the conflicting results to be 
explained by differences in geographic locale 
or type of instrument used. 

The fact that no study has reported greater 
male disclosure may be indicative of actual 
sex differences. The nature of any sex dif- 
E: К. M. Ziel. Values and self-disclosure. Unpub- 
lished honors thesis, Harvard University, 1962. 
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ferences might be found if researchers were 
to pay greater attention to the types of items 
which reliably discriminate between males 
and females, and types of situations in which 
males and females would or would not differ 
in disclosure output. For instance, Pederson 
and Breglio (1968a), using written self- 
descriptions, found that females did not use 
more words to describe themselves than 
males, but they disclosed more intimate 
information about themselves than males. 

Race, ethnic group, and social class factors. 
Jourard and Lasakow (1958) reported less 
disclosure by blacks than by whites. This 
finding has been replicated by Dimond and 
Hellkamp (1969), and by Littlefield? who 
also reported less disclosure by Mexican- 
Americans than by blacks. There is some evi- 
dence that racial differences in self-disclosure 
may be due to social class factors. Jaffee and 
Polansky (1962) found no differences in dis- 
closure between lower-class blacks and lower- 
class whites. Mayer (1967) had reported that 
middle-class women disclose more about their 
marital difficulties than do working-class 
women. 

Tulkin (1970) presents data from an on- 
going study which indicate that middle-class 
mothers initiate a greater number of verbaliza- 
tions and respond to a greater percentage of 
the child's vocalizations than lower-class 
mothers. These results suggest that the rein- 
forcement patterns that lead to variations in 
self-disclosure in adulthood may be operating 
as soon as the infant begins to babble. 

Cross-cultural findings. Lewin (1948) ob- 
served that Americans disclose a great deal 
about themselves and make friends easily, 
but do not develop highly intimate relation- 
ships, while Germans don't disclose much 
about themselves to others in general, but 
they do become quite intimate with a few 
others. Plog (1965) obtained evidence for the 
hypothesis that Americans disclose more than 
Germans. However, the hypothesis that Ger- 
mans would disclose more than Americans (0 
a close friend of the same or opposite Se 
was not supported. 


TR. P. Littlefield. An analysis of the self-disclosure 
patterns of ninth grade public school children in 
three selected subcultural groups. Unpublished doc- 
toral dissertation, Florida State University, 1968. 
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TABLE 1 


SiubiEs WHICH EXAMINED. THE RELATIONSHIP BETWEEN SELE-DISCLOSURE. AND Mental HEALTH 
Subjects | 
Author and date — — — | Measure of mental health 
Sex | Age | n 
Positive relationships between disclosure and mental health 
ححح‎ — —— — — 7 — — — = 
Mayo (1968) F | Adult | 80 Neurotic inpatients compared with 
| normals 
Taylor, Altman, & Frankfurt (see M | 17-20% | 100 | MMPI Hy, Is, Sc, Pd 
Footnote 8) | | 
Pederson & Higbee (19694) | r | CS | 51 Pederson Personality Inventory Cy- 
| 
| 


relationships between disclosure and mental health 


cloid disposition scale 
| 


Pederson & Higbee (19694) M CS | 56 | Pederson Personality Inventory Cy- 
| | cloid disposition scale 

Pederson & Breglio (1968a) F CS | 26 | Cycloid disposition scale 

Truas & Wittmer (in press) M/F Cs | 89 MMPI adjustment profile 

Persons & Marks (1970) M 18-22 | 81 ММРЇ Р, Pd, Pa, Sc 

Cozby (1972) F CS 30 Ratings of hypothetical stranger dis- 


| 
| 
| 


Stanley & Bownes (1966) 


Pederson & Breglio (1968a) 


Jourard (1964, p. 181) 


No relationship between disclosure and menta 


CS 


CS 


closing at low, medium, and high 
levels of intimacy 


il health 


137 Maudsley Inventory 
neuroticism sc 
26 | Pederson Personality Inventory Cy- 


cloid disposition scale 
Applicants for clinical services 


Note. Abbre ions: CS = colle: 
^ Naval reer 


b Prison inmates, 


Self-Disclosure and Mental Health 


Jourard (1959a) argued that the ability to 
allow one’s real self to be known to at least 
one "significant" other is a prerequisite for 
a healthy personality. Concerned with the 
Concept of self-actualization (Maslow, 1954), 
Jourard proposed that low disclosure is 
indicative of a repression of self and an 
inability to grow as a person. Jourard (1964) 


elaborated on the selí-disclosure-mental- 
health relationship, and suggested that the 
relationship between the two variables is 


Curvilinear, Although this brief summary does 
10е justice to Jourard’s (1959a, 1963, 1964) 
Views, it serves as an introduction to research 
investigating the relationship between self- 
disclosure and mental health. 

Ourard’s writings indicate that disclosure 


should be negatively related to “ 
maladjustment and also positively to “posi- 
tive” mental health (e.g., self-actualization), 
Table 1 summarizes the studies which have 
looked at the former type of mental health. 
With regard to the studies that support 
Jourard's initial hypothesis, it should be 
pointed out that the magnitude of the cor- 
relations are small: .28 in the Pederson and 
Higbee (1969a) study, and .18 to .34 in the 
Taylor, Altman, and Frankfurt § study, and 
these authors report that an attempted 
replication yielded even lower correlations. 
Although it is difficult to identity the cri- 
teria that should be used to judge whether a 


clinical” 


*D. A. Taylor, I. Aliman, & L. P. Frankfurt. 
Personality correlates of self-disclosure, Unpublished 
manuscript, 1965. 
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TABLE 2 


Srupies WHICH EXAMINED SELF-DIsCLOSURE AND “POSITIVE” MENTAL HEALTH 


Subjects | | 3 
Author & date اسا‎ Measure of mental health And 
me] ax | = | 
Vargas (sce Footnote 9) M CS 277 | Positive experiencing and behaving scale positive 
Halverson & Shore (1969) M/F | Adult 53 | Conceptual complexity, interpersonal flexi-| positive 
bility, general adaptibility 
Fitzgerald (1963) F CS 300 | Self-esteem index ns 
Doster & Strickland (1969) M/F CS 110 | Marlowe-Crowne Social Desirability scale ns 
Burhenne & Mirels (1970) F cs 56 | Marlowe-Crowne Social Desirability scale ns 


Note. Abbreviations: CS = college students. 
a Peace Corps trainees, 


study is testing the hypothesis that disclosure 
is related to positive mental health (cf. 
Smith, 1959), the studies listed in Table 2 
seem at least somewhat appropriate. The 
Vargas? study which measured self-actualiza- 
tion is perhaps the most appropriate. How- 
ever, our confidence in such a relationship 
would be increased by a study relating dis- 
closure to a more standardized measure of 
self-actualization such as the Personal Ori- 
entation Inventory. 

Is self-disclosure related to mental health? 
Virtually every type of relationship has been 
reported in the literature, yet no correlation 
reported has been greater than .50, and most 
are much lower. The curvilinear relationship 
suggested by Jourard (1964) seems intui- 
tively compelling for several reasons. The 
individual who never discloses may be unable 
to establish close relationships with others. 
A large portion of his self may be seen as 
threatening and is repressed (there is evidence 
that repressors talk less than sensitizers— 
Kaplan, 1967; Axtell & Cole, 1971—but it 
is not known whether repressors are also less 
intimate). In contrast, the individual who 
characteristically discloses a great deal about 
himself, not just to someone close such as a 
parent or spouse, but to anyone, may be 
perceived by others as maladjusted and also 
unable to relate to others because of a pre- 
occupation with his own self. The medium 
discloser may disclose a great deal to some- 


э В. Vargas. A study of certain personality charac- 
teristics of male college students who report frequent 
positive experiencing and behaving. Paper presented 
at the meeting of the Southeastern Psychological 
Association, New Orleans, February 1969. 


one who is very close, and generally main- 
tains a moderately close relationship with 
others (i.e., not too close to be offensive, but 
close enough to establish meaningful social 
bonds). If the relationship between self- 
disclosure and mental health is actually curvi- 
linear, there are two possible reasons for the 
conflicting results. First, none of the studies 
previously cited reported computation of a 
correlation ratio. Significant departures from 
linearity thus may have gone undetected. It is 
also possible that true linear relationships were 
obtained because of biases in the sample. A 
positive relationship between disclosure and 
mental health would be found with samples 
that contained ‘no high disclosers, while @ 
negative relationship would be found in а 
sample that contained no low disclosers. The 
author would suggest the following hypothe- 
ses: Persons with positive mental health 
(given that they can be identified) аге 
characterized by high disclosure to а few 
significant others and medium disclosure to 
others in the social environment. Individuals 
who are poorly adjusted (again assuming 4 
suitable identification can be made) are 
characterized by either high or low disclosure 
to virtually everyone in the social environ- 
ment. Future research should help to clarify 
this rather confused aspect of self-disclosure- 


Personality Correlates of Self-Disclosure 


Studies which have examined personality 
correlates of disclosure are shown in Table 2 а 
Like the previously cited studies on dis- 
closure and mental health, correlations 1F 
volving self-disclosure scores and various pel 
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TABLE 3 


STUDIES WHICH EXAMINED. PERSONALITY CORRELATES OF SELF-DiscLosuRE 


Subjects 
Author & date 2828 Tel Personality measure used IS 
Ses |. Age | n 
9и E | SEE — = — 
Femininity 
Taylor, Altman, & Frankfurt (see | M | 17-230 | 100 | MMPI f | positive 
‚ Footnote 8) | | | Guilford-Zimmerman positive 
Swensen (see Footnote 11) = = | — | Guilford-Zimmerman | positive 
Pederson & Breglio (1968a) M/F CS | 52 | Gough femininity ns 
Pederson & Higbee (19693) M/F | CS | 107 | Gough femininity ns 
Authoritarianism 
Taylor, Altman, & Frankfurt (see | M | 17-20% 100 | Rokeach Dogmatism | positive 
Footnote 8) | | 
Halverson & Shore (1969) M/F Adult 53 F scale | negative 
Worthy, Gary, & Kahn (1969) F | 09 48 | Rokeach Dogmatism | ns 
Sociability and Extraversion 
Taylor, Altman, & Frankfurt (ѕее M | 17-20 100 , Edwards Personal Preference positive 
l'ootnote 8) | | Schedule affiliation, succor- 
| ance, nurturance, hetero- 
| | sexuality | 
MMPI Si negative 
Tuckman (1966) M 17-24^ | 299 | Systems of integrative com- positive 
T = plexity 
Taylor & Oberlander (1969) M X = 18.5 | 32 | Accuracy in a facial recognition positive 
, Experiment 1 task 
Swensen (see Footnote 11) — = == Maudsley Personality Inven- positive 
lory extraversion 
Swensen (sce Footnote 11) — — | — Guilford-Zimmerman social positive 
| interest—sociability, ascend- | 
| ànce— social boldness | 
Frankfurt (see Footnote 10) 1 CS | 60 | Guilford-Zimmerman socia- — | ns 
| bility, thoughtfulness, per- | 
| | sonal relations | 
Pederson & Breglio (19682) M/F CS | 52 | Pederson Personality Inven- ns 
| їогу extraversion scale | 
Pederson & Higbee (1969a) M/F CS 107 | Pederson Personality Inven- ns 
tory extraversion scale 
College achievement 
Powell & Jourard (1963) M/E CS 80 | Grades | ns 
Jourard (1961d) E || CS 23 | Grades in nursing courses positive 
Grades in nonnursing courses ns 
A MN A 
Interpersonal trust 
rr -= == ج‎ ee 
Vondracek & Marshall (1971) M/I ES | 106 | Rotter Interpersonal Trust ns 
| scale 
г Ер 
ope Abbreviations: CS = college students. 


D Peel recruits, 
ce Corps trainees, 
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sonality measures have been generally low 
and often contradictory. 

The pattern of results involving measures 
of sociability and extraversion do seem con- 
sistent, and perhaps one may have confidence 
in a positive relationship between disclosure 
and extraversion. However, the studies by 
Frankfurt and Pederson and his associates 
are somewhat contradictory. 

It is clear that the personality character- 
istics associated with self-disclosure are not 
well understood. Altman and Taylor (1973) 
feel that it is unrealistic to expect to find 
specific trait-disclosure relationships. Their 
approach is to explore the relationship be- 
tween personality and self-disclosure in the 
context of specific relationships and settings, 
and is considered below. 

It is also likely that the use of the JSDQ 
is a factor in the generally low correlations 
between disclosure and various personality 
traits (including mental health). It was pre- 
viously mentioned that the JSDQ does not 
appear to predict actual disclosure to others. 
The JSDQ may be best interpreted as a 
measure of past history of disclosure. Some 
evidence of the subject's present disposition 
to disclose may be inferred from the subject's 
reports of disclosure to friends. Swensen ™ 
has reported that the highest personality 
trait-JSDQ correlations involve peers as 
target persons. Perhaps a more sensitive 
measure of disposition to disclose would be 
subjects’ willingness to disclose to an ac- 
quaintance, a stranger, or the experimenter, 

The relationship between disclosure and 
the Marlowe-Crowne Social Desirability Scale 
(SDS) is an example of the problems asso- 
ciated with the use of the JSDQ. Doster and 
Strickland (1969) found no relationship be- 
tween the JSDQ and SDS. However, Bur- 
henne and Mirels (1970) noted that the 
social-desirability-approval motive should be 
related to self-disclosure since high-SDS 


101, P. Frankfurt. The role of some individual 
and interpersonal factors on the acquaintance proc- 
ess. Unpublished doctoral dissertation, American Uni- 
versity, 1965. 

H C. Н. Swensen. Self-disclo 
personality variables. Paper presented at the meeting 
of the American Psychological Association, San 
Francisco, September 1968 | 


sure as a function of 


subjects protect their vulnerable self-esteem 
by being conforming, submissive, conven- 
tional, and generally self-protective. Burhenne 
and Mirels rated the disclosure of subjects’ 
written self-disclosure and found a correlation 
of —.41 (p < .01) between disclosure and the 
SDS, while the JSDQ correlated only —.03 
with the SDS. 

It must be concluded that continued use 
of the JSDQ will only perpetuate the con- 
fusion that already exists in the literature. 
Alternate methods of measuring self-disclo- 
sure include written self-descriptions, and 
Jourard's (1969) previously discussed ques- 
tionnaire since there is evidence for the 
predictive validity of this measure. 

The author, however, feels that it would be 
a mistake to continue the collection of corre- 
lations between personality trait measures and 
self-disclosure questionnaires. Instead, self- 
disclosure should be measured behaviorally 
and used as the dependent variable, An exam- 
ple of this type of procedure is provided by а 
study by Axtell and Cole (1971), who clas- 
sified subjects as repressors, sensitizers, OT 
neutrals, and measured the amount of time 
subjects spent discussing either positive 07 
negative aspects of themselves. It should be 
expected that any study using such a pro- 
cedure would yield results which would be 
considerably less equivocal than studies em" 
ploying any of the poorly developed sel 
disclosure questionnaires, The procedure 
would also allow the introduction of A 
tional variables into the design, and a 
development of better questionnaires by 2 
ministering the measure to subjects 4 al 
correlating scores with subjects’ actu 
disclosure. 


SELF-Discrosung IN INTERPERSONAY 
RELATIONSHIPS 
Social Exchange and Penetration Process’ re 


. 1080 
In this Section, the role of self-disclo a. 


H n > e 

in interpersona] relationships is cons test 
Self-disclosure is placed within the E tio? 
of social 


exchange and social pene Theory 
theory, According to social penetration | ат, 
(Altman & Taylor, 1973; Taylor, Mater” 
& Sorrentino, 1969), “the growth of ап "be a 
persona] relationship is hypothesized t 


FT 
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joint result of interpersonal reward/cost Skypeck!? and Rivenbark (1971) report 


factors, personality characteristics, and situa- 
tional determinants [Taylor et al, 1969, 
p. 325]." Relationships proceed from non- 
intimate to intimate areas of exchange. The 
Tate and amount of movement from non- 
intimate to intimate areas of exchange 
(which includes verbal disclosure, type of 
activities engaged in, and nonverbal com- 
munication) is determined by reward/cost 
factors of past, present, and projected future 
exchanges, Social penetration theory empha- 
sizes Teward/cost factors, and is thus similar 
lo social exchange theory (e.g. Homans, 
1961; Thibaut & Kelly, 1959). The time 
dimension emphasized by social penetration 
theory is not crucial in social exchange for- 
mulations, and social penetration theory is 
unique in its emphasis on the joint effects 
of verbal and nonverbal communication in 
the acquaintance process. 


Reciprocity of Self-Disclosure 


Jourard (1959b) measured self-disclosure 
Within a group of nine female nursing college 
faculty members. The amount disclosed to a 
given colleague correlated highly with the 
amount of disclosure received from that col- 
league. Jourard and Landsman (1960) repli- 
cated the above finding in a group of nine 
male graduate students, Further evidence for 
the reciprocity effect was obtained by Jourard 
and Richman (1963) who correlated subjects’ 
reports of disclosure output and input (dis- 
closures received) for mother, father, best 
male friend, and best female friend. 

Other researchers have extended the in- 
vestigation of reciprocity to different relation- 
ships and types of subjects. Levinger and 
Senn (1967) studied reciprocity of disclosure 
of feelings in marital relationships. The cor- 
relation between reported output and reported 
input was ‚91 for husbands and .79 for wives. 
Levinger and Senn point out that (like the 
Jourard and Richman, 1963, study) the input 
and output scores were obtained from the 
Same person, and are probably artificially 
high, Correlations between husband output 
or input. and wife output or input were 
Considerably lower, averaging about .50. 


high correlations between reported output and 
input in samples of subjects ranging from 
6 to 18 years of age. 

Several experiments have confronted sub- 
jects with a confederate who discloses at 
either high or low levels. Regardless of 
whether disclosure was manipulated by vary- 
ing intimacy (Ehrlich & Graeven, 1971; 
Murdoch, Chenowith, & Rissman Wy, Uf 
number of statements disclosed (Chittick & 
Himelstein, 1967), these studies have found 
that the high-disclosing confederate elicits 
greater self-disclosure from subjects than the 
low-disclosing confederate, 

One potential criticism of these studies is 
that the subject may be imitating the re- 
sponse of the confederate rather than en- 
gaging in a reciprocation process. However, 
Tognoli (1969) did not allow subjects to 
imitate by eliminating the confederate's state- 
ment as an option for the subject's response, 
and still obtained evidence for reciprocity 
of disclosure, 

Levin and Gergen (1969) suggested that 
medium amounts of disclosure from another 
person indicate his desire for a closer rela- 
tionship and his trustfulness, yet another 
who communicates a great deal about himself 
may be seen to lack discretion and to be 
untrustworthy, and predicted a curvilinear 
relationship between other disclosure and 
subject disclosure, Subjects received a 40- 
item self-rating form from a partner on which 
either 4, 16, or 32 items were checked. 
Rather than finding a curvilinear relationship, 
subjects revealed a mean total of 10.5, 172, 
and 22.0 items in the low, moderate, and 
high groups, respectively, Levin and Gergen 
(1969) concluded that *the more information 
revealed by the other, the greater the abso- 
lute amount returned. On the other hand, in 
economic terms, the subjects reveal propor- 


1? G. Skypeck. Self-disclosure in children, ages six 
through twelve. Unpublished master's thesis, Uni- 
versity of Florida, 1967. 

7 P. Murdoch, R. Chenowith, & K. Rissman. 
Eligibility and intimacy effects on self-disclosure, 
Paper presented at the meeting of the Society of 
Experimental Social Psychology, Madison, Wisconsin, 
October 31-Nov. 1, 1960. 
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tionately less as the partner reveals more 
and more [p. 448]." 

Worthy, Gary, and Kahn (1969) suggested 
that reception of self-disclosing information 
from another is rewarding since reception of 
such information implies that one is trusted, 
and that more intimate information repre- 
sents greater reward for the receiver. Worthy 
et al. tested several predictions derived from 
social exchange theory, among them the prin- 
ciple that outcomes exchanged will be of 
comparable value. Female subjects were 
tested in groups of four. On each of 10 trials, 
each subject chose questions to answer and 
send to each of the three other subjects. The 
questions had been prescaled for intimacy 
value. Support was obtained for the reciproc- 
ity hypothesis: The mean levels of disclosure 
(on a 7-point scale) to the other who had 
disclosed least, next, and most were 2.997, 
3.316, and 4.041, respectively. 

Cozby (1972) attempted to extend the so- 
cial exchange theory analysis of Worthy et 
al. (1969). Cozby argued that, while increas- 
ing intimacy of disclosure may represent 
increasing reward, various costs become sa- 
lient at high intimacy levels, the most obvi- 
ous being anxiety over revealing information 
one would rather keep private, and having 
to interact with an individual who is behaving 
in a somewhat unusual manner. This analysis 
is supported by Argyle and Kendon's (1967) 
description of increasing intimacy in inter- 
personal relationships until halted by anxi- 
ety. Subjects were shown 10 topics which 
a hypothetical other had disclosed. These 
were low, medium, or high in intimacy value. 
They then chose 10 out of 70 items to dis- 
close to the other. It was predicted that 
reciprocity would determine subjects’ re- 
sponses to the low- and medium-disclosing 
other, but that intimacy level would not in- 
crease (or possibly decrease) as other dis- 
closure increased from medium to high inti- 
macy. The results represent an independent 
replication of the Levin and Gergen (1969) 
experiment: А small (but significant at the 
:05 level) increase was observed as other 
disclosure increased from medium to high 
intimacy. 

Reciprocity appears to become less power- 
ful as a determinant of subjects’ responses 
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at high levels of intimacy, but there is as yet 
no explanation for the counterintuitive re- 
sults obtained by Levin and Gergen, and 
Cozby. It is possible that making self- 
disclosures is rewarding for the person who is 
disclosing, and the laboratory setting is a 
particularly likely place for this to occur. 
In other words, the reward effect may tend 
to outweigh the anxiety discussed previously 
in a laboratory setting in which subjects do 
not see each other, and there is little likeli- 
hood of future interactions, The Murdoch 
et al. (see Footnote 13) experiment is rele- 
vant to this discussion. Murdoch et al. varied 
whether or not there was a possibility of 
future interaction with the confederate, Sub- 
jects disclosed more to the confederate with 
whom there was no possibility of future 
interactions, It is possible that the curvilinear 
effect would be obtained in a situation in 
which there was a possibility of future inter- 
action with the other, or possibly if subject 
anonymity was precluded by previous €n- 
counter between the subject and confederate. 


Self-Disclosure and Liking 


Jourard and Lasakow (1958) found that 
disclosure to mother and father correlated 
significantly with liking (r = .63 and 53; 
respectively). In a subsequent investigation, 
Jourard (1959b) had each of nine female 
nursing college faculty members order the 
other eight subjects in terms of liking, and 
found disclosure positively related to liking: 
Jourard and Landsman (1960) replicated the 
Jourard (1959b) procedure with a group 9 
nine male graduate students. The relation- 
ship between disclosure and liking was no 
obtained with this male sample. Jourard 4P 
Landsman (1960) concluded that their yer 
sults, along with the findings of Jouraré 
(1959b), corroborate the “statements abound- 
ing in the popular literature which portra 
men as strong and silent and distrustful 
feelings; women are commonly portrayed 
more emotional, talkative, overly trusting T 
those whom they like, and hence more vulne 
able to betrayal than men [р. 185]-” Alte! 
natively, it is possible that the male group 
was a much less cohesive group, with [o 
interaction experiences across the 36 relatio! 
ships than the nursing faculty group. 


as 
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Other questionnaire studies finding a rela- 
tionship between disclosure and liking in- 
clude Halverson and Shore (1969), Fitzgerald 
(1963), and Altman and Haythorn (1965). 

Frankfurt (see Footnote 10) had female 
subjects indicate what they would disclose 
to another girl described as being liked or 
disliked in a series of scenarios which repre- 
Sented successive stages in the relationship. 
More was disclosed to the liked than dis- 
liked girl, and the effect occurred only in 
intimate topic areas, 

Worthy et al. (1969) predicted that liking 
leads to disclosure to the other, and also 
that disclosure from another will lead to 
greater liking, Subjects, in groups of four, 
had a 10-minute “get acquainted” session, 
and then indicated their liking for the other 
subjects. In the actual experiment, each sub- 
ject chose items of information to disclose to 
each other subject on each of 10 trials. On 
the first trial, it was found that most was 
disclosed to the most liked other, and least 
was disclosed to the least liked other. In 
addition, analysis of covariance of final 
liking scores with initial liking covaried 
showed that subjects liked most those others 
who had disclosed most to them. 

Cozby (1972) suggested that the relation- 
ship between self-disclosure and liking is 
curvilinear. The high-disclosing other may 
arouse anxiety with his overly intimate dis- 
closures, and so the high discloser may be 
seen as providing the subject with negative 
outcomes. Strong support for the predicted 
curvilinear relationship was obtained in a 
situation in which subjects were responding 
to a hypothetical other, 

The Worthy et al. and Cozby studies used 
female subjects. A study by Ehrlich and 
Graeven (1971) used male subjects who con- 
versed for 16 minutes with a male confederate 
who disclosed at either low- or high-intimacy 
values, There were no differences in rated 
liking of the confederate in the two condi- 
tions, This finding may support Jourard and 
Landsman's (1960) suggestion that the rela- 
tionship between disclosure and liking may 
be different for males and females, However, 
interpretation of liking ratings is made dif- 
ficult by a heretofore unmentioned aspect of 
these disclosure studies—content. Tn all the 


studies except Cozby (1972) which used a 
role-playing procedure, subjects received from 
another content about a topic which varied 
in intimacy value. To illustrate, the topic, 
“favorite television program,” is low in inti- 
macy value. In an actual interaction, this 
topic becomes a statement such as “Му 
favorite television program is Mod Squad." 
The problem becomes obvious: Is the sub- 
ject’s liking rating due to the intimacy value 
of the topic or his own feelings about that 
particular program? The variables of topic 
intimacy and similarity-dissimilarity of the 
content should be combined factorially. From 
the Cozby (1972) study and the work of 
Byrne (1969), it might be expected that the 
most liked other would be one who disclosed 
topics of medium intimacy and expressed 
views that were similar to those of the subject. 

Pederson and Higbee (1969b) noted that 
the Jourard and Landsman (1960) findings 
might indicate that the characteristics of the 
target person that determine disclosure may 
vary from one target person to another. Peder- 
son and Higbee correlated disclosure scores 
for mother, father, best opposite-sex friend 
and best same-sex friend with ratings of these 
targets on 11 adjectives. For both males and 
females, liking was significantly related to 
disclosure only for father and opposite-sex 
friend. For each target, there were sex dif- 
ferences in the adjectives that were related 
to self-disclosure. Interestingly, for both 
sexes, few characteristics of the same sex 
friend were related to disclosure to the 
friend. 

Rubin (1970) has recently provided dis- 
tinctions between liking and loving relation- 
ships, Although Rubin’s (1970) love and 
liking scales have not been correlated with 
self-disclosure, there are indications that the 
distinction may prove useful in the study 
self-disclosure, For instance, Rubin 
found that females love ( 
same-sex friends more th 
provides a reason for the 
tend to disclose 


of 
(1970) 
but not like) their 
an males do, which 
finding that women 
to their same-sex friends more 
than males do (Jourard & Lasakow, 1958), 

The finding that disclosure to spouse is 
sreater than to any other target (Jourard & 
Lasakow, 1958) suggests that love results in 


greater disclosure than liking. Within. the 
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marital relationship itself, it might be ex- 
pected that self-disclosure would be related 
to marital satisfaction. However, empirical 
support for such a relationship is weak. Katz 
et al. (1963) report that, for males, marital 
satisfaction is related to greater disclosures 
of anxieties and worries, but not of other 
topics. No relationship was found for wives. 
Levinger and Senn (1967) found that, in 
general, disclosure tends to be related to 
satisfaction, but that the correlation is higher 
between satisfaction and disclosure of pleas- 
ant feelings, contrary to the expectation that 
a strong marital bond would be characterized 
by the ability to openly discuss unpleasant 
feelings. However, this finding would be ex- 
pected if we assume that the disclosure of 
pleasant feelings is equivalent to exchanging 
positive rewards. Shapiro and  Swensen 
(1969) found no relationship between marital 
satisfaction and disclosure, 


Social Approval 


In an experiment by Colson?! a confed- 
erate responded to subjects! disclosures with 
positive, neutral, or negative evaluations. 
Disclosure was greatest in the positive evalu- 
ation condition and lowest in the negative 
evaluation condition. 

Tn an experiment by Taylor et al. (1969), 
a confederate responded to subjects’ dis- 
closures in four 45-minute interaction sessions 
according to one of four reinforcement 
schedules: (a) continuous positive, (5) early 
negative, later positive, (c) continuous nega- 
tive, or (d) early positive, later negative. As 
predicted, subjects in the continuous and 
later positive reinforcement conditions dis- 
closed more about themselves, talked longer, 
and were generally more intimate. For dura- 
tion and amount of disclosure, the differences 
were primarily at intimate areas of exchange. 
No contrast effect for disclosure or liking 
of the confederate was found in the later 
positive and later negative groups. 


Dependency and Self-Disclosure 


Altman and Haythorn (1965) investigated 
the self-disclosure patterns of pairs of naval 


14W. N. Colson, Self-disclosure as a function of 


social approval. Unpublished master's thesis, Howard 
University, 1968 
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recruits who were isolated in a small room 
for 10 days (high mutual dependency). A 
control condition consisted of pairs who spent 
most of their waking hours together, but had 
access to the outside world (low mutual de- 
pendency). The high-dependency dyads were 
more intimate and showed a more active pat- 
tern of social interaction than did the low- 
dependency dyads. 

Thibaut and Kelly (1959) suggested that 
the formality and constraint which charac- 
terizes first encounters functions to prevent 
forming relationships that may prove un- 
satisfying in the future. A total lack of depen- 
dency, as with interaction with a stranger 
with whom there is little likelihood of future 


interaction, removes the constraints and 
causes increased disclosure. Experimental 


support for the “stranger on a train" phe- 
nomenon had been demonstrated by Murdoch 
et al. (see Footnote 13). Taylor et al. (1969) 
obtained no effect of a 3 week versus 6 month 
interaction expectancy, suggesting that greater 
disclosure to a "stranger" may occur only 
when the expected interaction period 15 
very brief. 

There is also evidence that individuals will 
resist entering into relationships in which the 
other person holds a great deal of power. 
Kounin, Polansky, Biddle, Coburn, and Fenn 
(1956) found that persons feel more at ease 
and reveal more negative things about them- 
selves with a nonpowerful counselor. How- 
ever, under some circumstances, people may 
disclose more to a person with high power 
than to a person with low power. Slobin, 
Miller, and Porter (1968) found that, while 
greatest disclosure is made to fellow workers 
in a business organization, there is greater 
disclosure to immediate superiors than to im^ 
mediate subordinates. Perhaps disclosure to 
superiors is an ingratiation technique. 


Disclosure as a Function of Personality of 
the Intcractants 


Social penetration theory asserts that 
amount of mutual disclosure in a dyad is * 
function of the particular personality charac 
teristics of the two individuals. The obvious 
prediction that pairs of high disclosers Wa 
disclose more to one another than pairs of 10% 
disclosers has been confirmed in sever? 


- 


Acl 


i 
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studies (Jourard & Resnick, 1970; Taylor, 
1968). Jourard and Resnick also found that 
when a low-disclosing subject is paired with 
a high discloser, the subject increases his 
disclosure output to match the level of the 
high discloser. 

Pairs of subjects in a study by Altman and 
Haythorn (1965) were either homogeneously 
high, heterogeneous, or homogeneously low 
on need achievement, need affiliation, need 
dominance, and dogmatism. Although there 
were no significant effects of group composi- 
tion on self-disclosure, perhaps due to small 
N, high need achievers disclosed more than 
low achievers, and low-dominance subjects 
disclosed more than high-dominance subjects 
in intimate topic areas while the opposite was 
true in nonintimate topic areas. 

Swensen and Nelson 15 studied disclosure in 
dyads matched for similarity and dissimilar- 
ity in extraversion, neuroticism, and atti- 
tudes, Disclosure was highest in male pairs 
who were similar on extraversion but different 
on neuroticism, in female pairs who were 
different on attitudes, and in male-female 
pairs who were similar on all three vari- 
ables. Persons and Marks (1970) found that 
greatest intimacy occurred in an interview 
Situation when both interviewer and inter- 
viewee has the same MMPI code type (4-2, 
4-8, or 4-9) than when they were paired in 
nonmatched combinations. 


Self-Disclosure over Time 


Taylor (1968) administered a self-disclo- 
sure questionnaire to male freshmen room- 
mates after they had known each other for 
1, 3, 6, 9, and 13 weeks. Half of the room- 
mate pairs were both high revealers, while 
the other half were both low revealers, At all 
points in time, the high-revealing dyads re. 
ported more mutual disclosure than did the 
low-revealing dyads, although the rate of the 
Increase over time was approximately the 
same for both groups. There was 
increase in nonintimate disclosures, 
Slow, eradual increase in intimate disc 
C TU Бо шы. SCA 
: °С. Н. Swensen, & D. A. Nelson. Interpersonal 
ede a function of personality and attitudes 
1 Presented at the meeting of the Midwestern 
Sychological Association, Chicago 1967. 


а rapid 
and a 
losures 


over time. Measures of liking showed no sig- 
nificant differences between the low- and high- 
revealing dyads. However, both groups 
showed a significant decrement in liking over 
time, and this trend was more pronounced 
among the high revealers, This decrement in 
liking with a concomitant increase in dis- 
closure may mean that "familiarity breeds 
contempt," but it may also indicate that most 
randomly chosen dyads are not compatible 
enough to be comfortable at prolonged high 
disclosure. 

Frankíurt (see Footnote 10) obtained evi- 
dence of increasing amount and intimacy of 
disclosure as subjects described what they 
would disclose to another person at succes- 
sive stages in their relationship. A large in- 
crease in number of intimate topics dis- 
cussed was found for subjects who liked the 
other person, while subjects who did not like 
the other person showed only a slight increase, 

In the Taylor et al. (1969) study, favor- 
able reward/cost groups showed a greater 
increase in duration of time talked over four 
45-minute interaction periods than did nega- 
tive reward/cost groups, and the effect oc- 
curred primarily in intimate areas of ех- 
change. Contrary to expectations, greatest 
intimacy occurred during the first period of 
interaction, although a gradual increase from 
second to fourth period was found for all 
groups combined. 

Methodological problems make research on 
the developmental aspects of interpersonal 
relationships especially difficult. Observation 
of behavior over a long period of time in the 
laboratory, as in the Taylor et al. (1969) 


Study, may provide insights into the de 


velop- 
mental process, but it seems inevitable that 


factors such as fatigue will affect the results 
Ап alternative approach might be to study 
people who have known each other for vary. 
ing lengths of time, For instance, Jourard 
(1961a) administered the JSDO to colier 
students ranging Y ' та 


: g in age from 17 to 55 
t was found that disclosure to parents de. 
creased with age, while disclosure to 0 


aser Pposite- 
Sex friend or spouse | 


increased Up to age 40 
after which a decrease was observed, А prob 
lem with the Jourard study 


is that not all 
. and the effect of 
confounded With age 


subjects were married 
length of marriage is 
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but the study does illustrate the type of 
approach that might be taken. 


DISCLOSURE IN THERAPY 
AND EXPERIMENTS 


While the research reviewed in this section 
could have been placed within the context 
of “interpersonal relationships,” the experi- 
menter-subject or therapist-client relation- 

3 ship would seem to merit special con- 
sideration. The therapist, interviewer, and 
experimenter, whether it be for the person's 
own benefit or for scientific reasons, seeks 
information about another person's private 
self, The studies reviewed in this section 
indicate that the effects of therapist or experi- 
menter disclosure may have a number of 
practical and methodological implications for 
therapy outcome and experimental results. 

Several authors (Jourard, 1964, 1968; 
Mowrer, 1964; Rogers, 1961) have written 
on the importance of full client disclosure for 
successful therapy. Truax and Carkhuff 
(1965) have reported significant correlations 
between therapist and patient disclosure. 
Further, level of patient disclosure appears to 
be a predictor of final case outcome (Truax 
& Carkhuff, 1965). 

Powell (1968) found that subjects dis- 
closed more when the interviewer responded 
to subjects’ self-references with open dis- 
closure than when he used approval-suppor- 
tive or reflection-restatement techniques. 
However, Vondracek (1969) reported greater 
amounts of disclosure when the interviewer 
used a "probing" technique than when he 
was “reflecting” or “revealing.” Tt is possible 
that "probing" is more effective than self- 
revelation in eliciting highly intimate infor- 
mation (Tuckman, 1966). 

Jourard and Jaffee (1970) found that in- 
creases in the duration of interviewer dis- 
closure prior to discussing a topic resulted 
in increases in duration of subject disclosure 
on the topic. 

Vondracek and Vondracek (1971) have 
shown that sixth-grade children disclose more 
to an adult interviewer who discloses than 
to an interviewer who does not disclose. There 
was also evidence that the increased dis- 
closure was in the same content area dis- 
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closed by the interviewer. Ehrlich and Graeven 
(1971) did not find evidence of topical 
reciprocity in their male college student 
sample. Future research should determine the 
conditions under which topical reciprocity 
will occur. 

The interviewer or experimenter who dis- 
closes, in addition to eliciting greater disclo- 
sure from subjects (i.e., inducing a reciproc- 
ity effect), is rated as more trustworthy 
(Drag, see Footnote 3) and more positively 
in general (Jourard & Friedman, 1970) than 
the experimenter who does not self-disclose. 

Objections to the use of therapist disclo- 
sure to increase patient disclosure have been 
raised by Vondracek and Vondracek (1971) 
and Polansky (1967) who states that the 
technique is clinically very sloppy. Although 
these authors do not elaborate on their objec- 
tions, perhaps they feel that disclosure to 
certain patients would have an adverse effect 
on the course of therapy, or that certain 
therapists would feel uncomfortable disclosing 
and communicate the discomfort to the 
patient, 

Disclosure may also affect the nature of 
subjects! responses in experiments, In general, 
studies which manipulate experimenter dis- 
closure involve a period of experimenter- 
subject interaction of approximately 30 min- 
utes in duration in which the experimenter 
and subject discuss their "personal back- 
grounds and interests, problems, and hopes 
and aspirations [Jourard, 1969, p. 113]. 
Control subjects have no prior contact with 
the experimenter. Subjects who receive €* 
perimenter disclosure have been shown tO 
change their Edwards Personal Preference 
Schedule responses (Jourard & Kormann; 
1968), give more revealing responses O” 
the Rotter Incomplete Sentence Blank 
(Heifitz'^), and require fewer trials to cri 
terion on a paired-associate learning task 
(Frey ?*), 


10 М. L. Heifitz. Experimenter effect upon pes 
ness of response to the Rotter Incomplete Senter 
Blank. Unpublished honors paper, University 
Florida, 1967. ial 

17 М. Frey. The effects of self-disclosure and suene 
reinforcement оп performance in paired-assoCi? v 
learning. Unpublished honors paper, Universit, 
Florida, 1968 
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Drag (1969) studies the effect of experi- 
menter participation in discussion groups 
composed of two, four, or eight subjects 
whose task was to disclose themselves on 
various topics. In groups in which the experi- 
menter did not disclose, Drag (1969) repli- 
cated the common finding that smaller groups 
provide the participants more opportunity to 
interact or express themselves (Thomas & 
Fink, 1963). When the experimenter joined 
in the discussion and self-disclosed, amount 
of subject disclosure was not affected in the 
two- or eight-person groups. However, in the 
four-person group, subject disclosure was 
similar to disclosure scores in the two-person 
group, and significantly greater than in the 
four-person-no-experimenter-disclosure group. 
This result was not expected, for it had been 
predicted that amount of disclosure would 
increase by equal amounts in all three groups 
when the experimenter disclosed. The finding 
would seem to indicate that in the eight- 
person group, experimenter disclosure (or 
probably disclosure by any group member) 
is not sufficient to overcome the inhibiting 
effects of the large group. 

Jourard (1969) has argued that disclosure 
by experimenters will result in greater hon- 
esty by subjects, and also prevent experi- 
menters from acting like “spies” and inhuman 
manipulators. An equity theory (Adams, 
1965) analysis would suggest that the experi- 
menter who asks subjects to fill out question- 
naires, yet who tells nothing of himself, may 
have created a situation of inequity. If this 
is actually the case, subjects may attempt to 
restore equity by lying, or being careless in 
their responses (see Weick, 1970). 

The equity theory analysis rests upon the 
assumption that subjects perceive inequity 1n 
the situation. It is possible, however, that the 
subject actually expects to disclose in the 
experimental situation, and that the oppor- 
tunity to disclose is a major reason for coming 
to the experiment (Hood & Back, 1971). 
Hood and Back found that, among males, 

for social psychological experi- 
jeep higher disclosers than nonvolun- 

з Among females, J SDQ а I 
lated to volunteering only " Ager x 
which seemed to indicate that subjects _ 


87 


would be required to make statements about 
themselves. 

Future research should determine the con- 
ditions under which experimenter or subject 
disclosure will affect experimental results, and 
the nature of the effects themselves. One 
possibility that deserves investigation is 
whether disclosure by the experimenter has 
the effect of making subjects more sensitive 
to demand characteristics (Orne, 1962) or 
experimenter biases (Rosenthal, 1966). If so, 
we can imagine subjects giving dishonest re- 
sponses in order to try to please the experi- 
menter and confirm his hypotheses. Future 
research in this area should also include con- 
trols for amount of time the experimenter 
spends with the subject (ie, a separate 
condition with very nonintimate disclosure). 


SrLr-DISCLOSURE AND OTHER 
“INTIMACY” VARIABLES 


The affiliative conflict theory (Argyle & 
Dean, 1965) asserts that an equilibrium for 
“intimacy” exists for any pair of individuals, 
and that the equilibrium point is a joint 
function of eye contact, physical proximity, 
amount of smiling, intimacy of topic (Xe; 
self-disclosure), and other unspecified vari- 
ables. The theory states that if the equilib- 
rium is disturbed along any one of its dimen- 
sions, attempts will be made to restore it by 
adjusting the others. A study by Exline, 
Gray, and Schutte (1965) found support for 
the theory in an experiment which related 
eye-contact to intimacy of interview material. 
Subjects spent significantly less time looking 
at the interviewer in an intimate interview 
condition than in a nonintimate interview. 

Jourard and Friedman (1970) examined 
the relationship between experimenter-subject 
“distance” and self-disclosure. In Experi- 
ment 1, females (but not males) increased 
duration of disclosure as distance increased, 
thus providing partial support for the theory. 
Distance in this study was varied by using a 
combination of eye contact and physical dis- 
tance. In Experiment 2, distance was varied 
by a combination of the experimenter touching 
and/or disclosing to the subject. Regardless 
of sex, subjects increased disclosure time as 
distance decreased. The authors concluded 
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that the Argyle and Dean equilibrium phe- 
nomenon will not be obtained in situations 
in which there is trust and positive feelings 
toward the person who comes close. However, 
the use of self-disclosure as both indepen- 
dent and dependent variable neglects the 
multivariate nature of the Argyle and Dean 
theory, and leads to the suggestion that the 
results of Jourard and Friedman's second 
experiment do little more than provide addi- 
tional confirmation of the reciprocity rule in 
self-disclosure. 

Regardless of the validity of the Argyle 
and Dean (1965) theory, it is obvious that 
people do engage in nonverbal communica- 
tion, vary their speaking distance, and use 
objects in the physical environment while 
communicating verbally, Altman and Taylor 
(1973) point out that research оп non- 
verbal, environmental, and verbal behaviors 
has proceeded in parallel fashion, but that 
no attempt has been made to integrate the 
behaviors into a rich description of inter- 
personal relationships. 


CONCLUSION 


Simmel (1964) writes that “obviously, all 
relations which people have to one another 
are based on their knowing something about 
one another [p. 307].” This statement seems 
So intuitively obvious that it is surprising 
that there is not more research or theoretical 
development in the area of self-disclosure, ТЕ 
also seems obvious that there are individual 
differences in self-disclosure, yet we know 
little about the meaning of these differences, 

Self-disclosing behavior may be seen as 
the product of two opposing forces, one 
operating to increase disclosure, the other 
Operating to inhibit disclosure. The first 
force is the one studied Most extensively by 
disclosure researchers, There are also factors 


which operate to inhibit disclosure. These 
might be termed discretion, or a ned for 
Privacy, and have been neglected in research 
on self-disclosure, Rather, self-disclosure has 
been closely associated with 1 x 
sroup movement which emphasizes the need 


to relate, communicate, and be hone: 
Open with others, 
been made 


the encounter 


st and 
Self-disclosure has thus 


a positive value, something to ре 


COZBY 


fostered. This orientation to self-disclosure 
has been expressed by Bennett (1967): 


The critical problem we face is not how to keep 
secrets from each other, but how to facilitate this 
readiness to communicate. The overriding question 
is how to maintain an atmosphere of trust and con- 
fidence which will enable us to talk about personal 
affairs as freely as we talk about automobiles; to 
share experience as we share the weather [p. 375] 


What happens to privacy when we share 
our intimate thoughts as easily as we talk 
about the weather? Bennett classifies people 
who would withhold information as hermits, 
prudes, paranoids, or rascals. Yet Altman 
and Taylor (1973) have expressed fear 
of a "tyranny of openness" in which indi- 
viduals have no freedom to have private 
thoughts because full disclosure is demanded 
by others in all social encounters. 

Simmel (1964) has written extensively on 
the importance of discretion, and suggests 
that “an ideal sphere lies around every human 
being . . . this sphere cannot be penetrated, 
unless the personality value of the individual 
is thereby destroyed [p. 321].” Thus, selí- 
concealment (not lying or misrepre: 
one’s self, but maintaining 
the self) may have functional significance in 
giving a person a sense of “individuality.” 
Simmel (1964) also suggests that some mari- 
tal difficulties are the result of too much 
self-disclosure: “They lapse into a matter-of- 
factness which has no longer any room for 
surprises [p. 329].” Thus, discretion may 
important in preventing boredom from occu 
ring in interpersonal relationships. 

It is hoped that the peculiar value ori- 


entation that has accompanied much of the 
written discussion of s 


dispensed with, 
that the rese 
know 


senting 
a private area of 


be 
r- 


elf-disclosure can be 
It seems far more important 
arch be conducted. We should 
more about the relationship between 
verbal and nonverbal components of messages 
(Duncan, 1969), We can ask about the 
nature of self-disclosure when people with- 
draw from a relationship (Altman & Taylor, 
1973). What is the function of privacy in 
interpersonal relationships? What are the 
Consequences of unrestrained self-disclosure? 
So much remains to be learned about this 


important aspect 


of interpersonal relation- 
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ships that value-laden speculations that close 
off areas of investigation can only be seen 
as a hinderance to our understanding of 
self-disclosure. 
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ALCOHOL Usk 

By the time t 

high school, 
established 

drinking 


he adolescent graduates from 
it is quite likely that he has 
à regular (light to moderate) 


5 pattern, The Proportion of adoles- 
cent drinkers varies from community to com- 


munity—anywhere from 30%-40% to 60%— 
8076 of the adolescent population, This 
Variation reflects regional and ethnic sub- 
cultural differences (Maddox, 1970). While 
the vast Majority of adolescents are either 
abstainers Or light to moderate drinkers, 
Somewhere between 2%-6°% of the adolescent 
Population are problem | drinkers—drinkers 
for whom alcohol has come to occupy a more 
than Simply incidental role in their lives. 
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The consensus of a large number of em- 
pirical studies reviewed by Maddox (1970) 
was that the majority of adolescent drinkers 
report that their first drinking experience was 
in their own home with parents and/or 
relatives present, that they see drinking as an 
adult behavior, and that alcoholic beverages 
are typically readily available around the 
house, Jessor, Graves, Hanson, & Jessor 
(1968) found that unsanctioned introductions 
to drinking and heavy consumption patterns 
among high school age children were related 
to the heavy drinking behavior of adult Ute 
males. In his clinical observations of adoles- 
cent problem drinkers, McKay (1961) noted 
that almost all the fathers and some of 
the mothers in his sample were alcoholics. 
Gusfield (1970) found that within a college 
student sample, high users of alcoholic bever- 
ages were more likely than low users to have 
parents who were also high users of alcohol. 
Summarizing the research in this area, 
Maddox (1970) concluded that knowledge of 
parents’ drinking patterns is the single most 
accurate tool for predicting adolescent drink- 
ing behavior. Unfortunately, however, there is 
a lack of systematic investigation into indi- 
vidual difference variables which could explain 
the substantial number of false positives in 
the research reviewed above (e.g., personal 
feelings of shame over heavy parental drink- 
ing could explain why a given adolescent is 
an abstainer, even though his parents are 
alcoholics. 

More recently, Jessor, Collins, and Jessor 
(1972) found that the opportunity to learn 
and to be reinforced for drinking, especially 
by peers, plays a prime role in the prediction 
of abstainer to drinker status shiíts among 
junior-senior high school students. Studies 
with college student peer groups, specifically 
fraternities, have supported the notion that 
peer influences on adolescent attitudes toward 
drinking and drinking behavior are substan- 
tial. Gusfield (1970) and Rogers (1970) both 
found that fraternity members are more likely 
to be drinkers than nonfraternity members. 
Gusfield went one step further by showing 
that fraternity members with high-user par- 
ents (who would, therefore, be expected to 
be high users) drank no more frequently ог 
excessively than fraternity members with 


low-user parents. Although the drinking oc- 
curring in the context of another kind of peer 
group, adolescent gangs, may at times be an 
expression of rebellion and hostility, Maddox 
(1970) has found that this is not typically 
the case. In the absence of data on the proc- 
ess of influence, findings such as those above 
are left equivocal. Uncontrolled selective 
factors (both self and group determined) that 
affect fraternity or gang membership may 
play roles which are as significant or more 
significant than group membership itself, 

The evidence on the relation between reli- 
gious affiliation and drinking behavior is 
fairly conclusive, Maddox (1970) points out 
that, due to the norms concerning drinking 
in the respective religions, the average drinker 
will probably be a Jew or a Catholic rather 
than a Protestant. Snyder (1958) found, 
however, using a sample of college alcohol 
users, that the intoxication rates for ascetic 
Protestant and Mormon students is much 
higher than that of Jewish students. Simi- 
larly, Skolnick (1958) found that the rate 
of social complications due to drinking for 
Jewish students who drink is much lower than 
the rate of social complications for student 
drinkers who are members of ascetic Protes- 
tant groups. Finally, Gusfield (1970) found 
that Protestant college students were more 
often high users of alcohol than were Jewish 
students. Thus, while fewer Protestant and 
Mormon than Jewish students drink, those 
Protestant and Mormon students who drink 
are more likely to be problem drinkers than 
the Jewish students who drink. 

Mizruchi and Perrucci (1970) explain 
these findings by using the concepts of pre- 
scriptive and proscriptive norms. The norms 
in the Jewish religion concerning drinking 
behavior are prescriptive: there is an elabo- 
rate system of explicit directives as to what, 
when, where, with whom, how much, and 
why one is expected to consume alcoholic 
beverages (including, we may add, occasions 
when drinking to moderate excess is socially 
tolerable, ог even approved). The norms of 
the Protestant (especially ascetic Protestant) 
religion are proscriptive in nature: Drinking 
is forbidden. Thus, when a Jew decides to 
use alcoholic beverages, norms advocating 
moderation exert a controlling influence on 
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his drinking behavior. However, when persons 
from a Mormon or Protestant background 
use alcoholic beverages, there is an absence 
of norms to guide their use, and the drinking 
is more likely to be uncontrolled. The issue 
is not unrelated to the possible positive and 
negative roles of the parental model since 
parents are, after all, the principal early-life 
transmitters of societal norms. 

Two other important points concerning 
religion's influence upon drinking behavior 
are made by Maddox (1970): (a) Close 
association with any organized religion will 
depress the rate of problem-related drinking, 
as there is no (major) religion which advo- 
cates the intemperate use of alcohol; (b) 
“. . . there is no indication that religious 
organizations as socializing agencies can or 
do intervene effectively to increase the preva- 
lence of abstinence among persons whose 
interpersonal relationships otherwise support 
drinking [p. 116].” 

In summary, the research literature con- 
cerning the sociocultural correlates of ado- 
lescent problem drinking focuses on three 
interrelated variables: parental models, peer 
models and sanctions, and religious affiliation. 
The most common variety of adolescent 
problem drinker would seem to come from a 
home where the parents are high users of 
alcohol and would seem to belong to peer 
groups in which high alcohol use is also 
encouraged. Furthermore, such individuals 
likely have rather loose religious affiliations 
with one of the abstinence-oriented religions. 
Unfortunately, however, there has been little 
reported research into the relationship of 
adolescent problem drinking to such basic 
sociocultural variables as a race, socio- 
economic status, and living conditions. 


PERSONALITY CORRELATES OF ALCOHOL USE 


Within the past 10 years there have been 
many studies which have addressed them- 
selves to the question of Whether or not there 
exists a pattern of personality traits common 
to most adolescent problem drinkers. The 
most comprehensive study was conducted by 
Williams (1970), who used the Adjective 
Check List as his primary assessment instru- 
ment. His results characterize the college- 
age, male problem drinker as a comparatively 


independent, aggressive, anxious, impulsive 
and depressed individual, who deemphasizes 
both primary and secondary relationships, is 
relatively unconcerned about others and does 
not think too highly of himself, In an earlier 
study, using the same measuring instruments 
an an all male college population, Williams 

(1968) found a significant positive correlation 
between the problem drinking and anxiety. 

Williams’ (1970) findings, that problem 
drinkers are relatively impulsive, aggressive, 
and depressed, have also been noted by 
Mackay. MacKay (1961) reported many 
incidents of “impulsive angry outbursts” on 
the part of juvenile problem drinkers and 
their parents, and cited evidence of depres- 
sion in these adolescents ranging from “feel- 
ings of irritability, worthlessness, fear of the 
future . . . poor appetite, strange eating 
habits, and sleeplessness to overt suicide 
attempts [p. 26]." 

The findings of Jones (1968, 1971) are 
very similar to those just reviewed. Jones’ 
work is unique in two respects: It is longi- 
tudinal in nature, and she uses both male and 
female subjects. In her longitudinal study of 
problem drinkers, utilizing preproblem data, 
Jones found both her male and female pre- 
problem subjects to be more unstable, un- 
predictable and impulsive than other pre- 
moderate drinkers or abstainers. At the same 
time, she found that females who are des- 
tined to be problem drinkers are more unlike 
female prenormal drinkers than are male pre- 
problem drinkers when compared to male 
prenormal drinkers. 

Jessor, Carman, and Grossman (1968) 
found that individuals in a college environ- 
ment who have low expectations of attaining 
academic and social recognition will tend to 
drink more often, get drunk more often, and 
get into trouble over drinking more often 
than will those individuals who have higher 
expectations in these two areas. 

Findings in the area of sexual identity are 
quite confusing. Parker (1969) found that 
a sex-role conflict exists for male college 
students with a strong alcohol disposition. 
They want to be perceived as very masculine, 
but at the same time they themselves do not 
identify with masculine traits. A study by 
Zucker (1968) concluded that adolescent 
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problem drinkers are not experiencing a sex- 
role conflict, but are only attempting to pre- 
sent to the world a sex-role facade—high 
school problem drinkers only want to seem 
more masculine than do the moderate to 
light drinkers. The disparity in Parker and 
Zucker’s results may be due to the use of 
different samples (college, high school), or to 
ambiguities in the concept of sexual identity 
and the derivative inconsistencies in the 
criteria, 

While the above research suggests a num- 
ber of preproblem use correlates of problem 
alcohol use, why, given these person states, 
does the youth turn to alcohol use? Only 
Jessor, Carman, and Grossman (1968) and 
Williams (1966, 1968) consider this last 
question to any significant extent. essor, 
Carman, and Grossman hypothesize that 
individuals in a college setting will become 
highly involved with alcohol when their 
expectations of attaining two socially and 
presumably personally valued goals are low. 
In this context, heavy alcohol consumption 
is seen as an alternative way of attaining 
these previously unattainable goals, a way of 
attaining substitute goals, or a way of coping 
with actual or anticipated failure in regard 
to goal attainment. Јеѕѕог, Carman, and 
Grossman showed that a relationship does 
exist between low expectations of need satis- 
faction (a need here is viewed as an inter- 
nalized goal) and high alcohol involvement. 

It is a widely held belief that many ado- 
lescent problem drinkers drink and get drunk 
often in order to escape from their high 
levels of anxiety and depression. Logon 
(1966) tested this idea and found RUN 
alcohol does afford problem drinkers will 
some relief when they are moderately intoxi- 
cated, but not when they are severely in- 
toxicated. While the results of his study can 
explain why an anxious and depressed person 
might drink moderately and frequently, they 
do not help to explain why an anxious and 
depressed individual would become a problem 
drinker (associated with both excessive and 
frequent drinking) since relief from anxiety 
and depression is apparently not gained from 
excessive alcohol consumption. : > 

Using the personality variables which dif- 
ferentiated problem drinkers from non- 


problem drinkers in his 1970 study, Williams 
(1968) studied personality changes occurring 
aS a person consumes more and more alcohol. 
Using a college male sample, Williams found 
that the personality descriptions of non- 
problem drinkers under the influence of 
alcohol closely matched the personality of 
problems drinkers when sober, Williams con- 
cludes that a problem drinker feels more at 
home in a drinking situation than a non- 
problem drinker because he is around people 
who are now aggressive, self-centered, impul- 
sive and the like, just as he is. This would 
explain why an adolescent male with this 
given set of personality traits might tend to 
gravitate toward situations where alcohol is 
being served and tend to drink frequently, 
but it does not explain why he will drink 
excessively. 

McClelland’s (1971) study attempted to 
answer the question, “What are the psycho- 
logical effects of alcohol?” McClelland found 
that as male students at fraternity parties 
drank more, there was a corresponding in- 
crease in power thoughts (as indicated on 
a Thematic Apperception Test type mea- 
sure—thoughts of having impact on others, 
thoughts of aggressions, of sexual conquest 
and of being strong, big, and influential, 
After moderate drinking these power thoughts 
centered on the altruistic use of power on 
behalf of others, while after heavy drinking 
the power thoughts concerned personal domi- 
nance over others. McClelland concluded 
that the problem drinker is an individual who 
is intensely concerned with his own personal 
power and who has chosen drinking as a way 
to accentuate his feelings of power. 

In summary, there appears to be sub- 
stantial agreement that adolescent problem 
drinkers are individuals lacking in personal 
controls, as evidenced by relatively high ag- 
gressiveness and impulsiveness, At the same 
time, there exists some evidence which indi- 
cates that adolescent problem drinkers have 
some basic neurotic tendencies which form a 
coherent cluster of traits: relatively low 
self-esteem, high anxiety, depression, and a 
general lack of success in the attainment of 
life goals. 

There are some basic areas of disagreement 
concerning the existence of two basic con- 
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flicts within the personality of the adolescent 
problem drinker: independence and sex-role 
conflicts. A likely contributor to this problem 
is the use of different measures of criteria 
(e.g., sex-role conflict; Parker, 1969; Zucker, 
1968). Tied to this issue is the questionable 
rationale for criterion choices made by many 
of the researchers. Although the indepen- 
dence-dependence trait, for example, may be 
correlated with adolescent problem drinking, 
there is no compelling theoretical rationale 
for the choice of this trait as a prime inde- 
pendent variable. The Jessor, Carman, and 
Grossman (1968) study is the exception to 
this rule as it is made conceptually clear, 
prior to any empirical research, why alcohol 
use as a coping behavior is related to low 
expectation of goal attainment. Furthermore, 
most research on alcohol consumption among 
adolescents has been limited to males of 
college age. It is not at all certain that these 
results can be generalized to college fe- 


males, or to males or females who are under 
college age. 


SOCIOCULTURAL CORRELATES OF ADOLESCENT 
Nancoric Usr 


Much research has focused on the familial 
environment of the adolescent addict. Family 
fragmentation, in the form of either the 


; and by Lewis 
Who noted that the family 
ally lacks cohesiveness, 

51 many researchers agree that the 
Parents of addicts are e Е 


types (Modlin & Montes, 196 ality 
marked lack of › 1964), there is a 


(1961), Bender (1963), and Laskowitz 
е second pattern is that of the 


underdominating Parent (Ausubel, 1961; 


Bender, 1963). This personality pattern is 
supported by Rosenberg’s study (1968) which 
discovered a relationship between inadequate 
parental control and adolescent addiction and 
the research of Robins (1966), which found 
that subjects who were diagnosed as psycho- 
paths having problems with alcohol and drugs 
were early in the sibling birth order and fre- 
quently came from families whose parents 
were uninterested in them and exercised little 
discipline. While Rosenberg (1969) has iso- 
lated the trait of underdominance as typical 
of the mother of the adolescent addict, Lewis 
and Osberg (1958) hypothesize that the 
father, if not absent, is the weak individual 
in the family, 

The third parental personality pattern 
thought to be related to adolescent addiction 
is overdomination (Ausubel, 1961; Bender, 
1963). Zimmering (1952) and Lewis and 
Osberg (1958) feel that the pattern consists 
specifically of an overdominating mother, 
while Mason (1958) posits that these over- 
powering mothers simulate aggression and 
sexual drives in their children, then deny and 
punish these drives, Rosenberg (1969) found 
that the father of the adolescent addict repre- 
sented the overdominating influence in the 
family, frequently the symbol of punishment 
and rejection. 

Another theme in the research on socio- 
cultural correlates of adolescent addiction is 
parental rejection during the formative years, 
While Rosenberg (1969) points to frequent 
paternal rejection, Hirsh’s (1961) theory sug- 
gests that sons who are infantilized by 
rejecting narcissistic mothers are prone to 
addiction. Gold (1957) also suggests the 
salience of a rejecting. environment within 
families, producing insecurity feelings. Rosen- 
berg (1968, 1969) concludes that the early 
life of the addict is marked by either a 
general deprivation of parental models or the 
Presence of inadequate models, as indicated 
by a high degree of parental alcohol abuse 
or mental or physical illness. Nyswander 
(1956) disagrees to some extent with the 
deviant family model by stating that addicts 
have no more or less problems with their 
tet patent Aeg ry, d 
insere ue = t ough, there is a lack of 

9 specifics in the studies noted 
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above, this body of research (primarily 
retrospective studies) seems to point to the 
hypothesis that the adolescent addict, in his 
early years, suffers from mentally disturbed 
parents, from rejecting parents, or from the 
absence of one or both parents. 

The ethnic composition of adolescent nar- 
cotic addicts has been studied by Chein 
(1959), Bender (1963), and Vaillant 
(1966a), who agree that most addicts are 
members of a minority community. There 
scem to exist two implications of this finding. 
First, being a member of an ethnic minority 
leads to the possibility of racial discrimina- 
tion; both Murray (1967) and Zimmering 
(1952) found racial discrimination to bea 
relevant factor in narcotics addiction. Second, 
Vaillant’s (1966a) sample of adolescent ad- 
dicts were not only from a minority com- 
munity but were first generation, native born 
Americans. This finding points to the pos- 
sibility that one of the sociocultural factors 
relevant to narcotics addition is a cultural 
clash between parent and child. ©. 

Chein (1959), Bender (1963), and Vaillant 
(1966b) found that narcotic addicts are gen- 
erally found in urban and metropolitan areas, 
These addicts usually grow up in poor resi- 
dential areas (Murray, 1967) and come from 
families in low-income groups (Bender, 1963; 
Chein, 1959; Vaillant, 1966b). 

The hypothesis that there is some parent- 
child conflict regarding status achievement is 
supported by the studies of Rosenberg (1968, 
1969), Vaillant (1966b), and Laskowitz 
(1961) who found an inadequate vocational 
adjustment history in the backgrounds of 
their addict sample. The studies of Ausubel 
(1961), Scher (1966), Gamso and Mason 
(1958), Cameron (1963), and Modlin and 
Montes (1964) also indicate that it is in 
these urban, poor residential areas that there 
is a relatively high availability of narcotics, 
and it is this availability which also helps 
to determine who becomes a drug addict. 

A number of studies (Cameron, 1963; 

1958; Modlin & Montes, 

Gamso & Mason, m P. Tave 
1964; Scher, 1966; Vaillant, 1966b; ha 

; that adolescent narcotics addiction 

е connected with peer-group entice- 

ж 3 deviance. Furthermore, Little and 


Pearson (1966) attribute perpetuation of 


drug addiction to the interdependence of an 
addict and others whom the addict persuades 
to experiment with narcotics. 

In summary, research indicates that the 
adolescent narcotic user is frequently a mem- 
bers of an ethnic minority, often from an 
impoverished urban environment, often from 
a broken home. There is further consensus 
regarding the importance of peer group en- 
ticement and availability of narcotics. While 
there seems to be agreement on the general 
point that adolescent users are usually subject 
to deficient parental models, there is marked 
disagreement as to the nature of this de- 
ficiency; one or both of the parents are 
characterized as being either overprotecting, 
overdominating, underdominating, or reject- 
ing. This lack of consensus can be attributed 
to the heavy reliance on retrospective studies 
of the narcotic users! parents, 


PERSONALITY FACTORS RELATED TO 
Narcotic Use 


Intelligence has been a variable considered 
by several researchers. Zimmering (1952) 
found that most addicts to be of dull normal 
intelligence, but Rosenberg (1968), Vaillant 
(1966b), and Knight and Prout (1951) dis- 
agree. They found their subjects to range 
from average to superior intelligence. Im- 
maturity is considered a personality charac- 
teristic of youthful addicts by Rettig and 
Pasamanick (1964), Ausubel (1961), and 
Bender (1963). Both Bender (1963) and 
Zimmering (1952), even though their observa- 
tions were conducted a decade apart, agree 
that addicts show low frustration tolerance, 
that they tend to repress their troubles, and 
that they tend to withdraw into fantasies. 
Knight and Prout (1951) described their 
clinic population of addicts as being into- 
verted, shy, insecure (also Gilbert & 
Lombardi, 1967), having a flat affect, shallow 
interests and immature goals, and lacking 
healthy resources and adequate structure. 
Gilbert and Lombardi (1967), using Minne- 
sota Multiphasic Personality Inventory data, 
concluded that addicts generally are depressed 
(also Leeds, 1965), tense, feel inadequate 
(also Laskowitz, 1961), are irresponsible 
(also Laskowitz, 1961), are impatient and 
irritable, lack persistence, are hypersensitive 
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apprehensive, and have poor morale. 
pet ‘Gilbert and Lombardi (1967) and 
Laskowitz (1961) noted the egocentrism of 
addicts in their disregard for social mores 
and in their view of themselves as members 
of an elite group. Restlessness and quick 
dissatisfaction, private logic following, limited 
problem solving, and overvaluation of the 
masculine role are described by Laskowitz 
(1961). Savitt (1963) describes the addict 
as à person unable to experience love and 
gratification through the usual channels. An 
interesting Study by Cooper (1959) suggests 
that the personality descriptions of drug ad- 
dicts in literature are not at all like the 
self-perceptions of these individuals, 

Most researchers agree that most narcotic 
addicts have some mental problems or a weak 
or disturbed personality (Ausubel, 1961; 
Chein, Gerard, Lee, & Rosenfeld, 1964; 
Gerard & Kornetsky, 1954: Holmberg & 
Jansson, 1968; 1961; Savitt, 


1963; Isbell, & Chapman, 1948; 


) and Bender (1963) 
disorder, 


to be 
Y Gilbert and Lom. 
57) 


many of the classical psychoneurotic and 
psychotic features [p. 97]." Rosenberg 
(1969) reported less disturbance on psycho- 
logical tests by addicts than alcoholics, al- 
though Gilbert and Lombardi (1967) found 
addicts to have deeper and more widespread 
pathology than nonaddicts, Holmberg and 
Jansson (1968) consider schizoid curiosity 
to be a factor in narcotic use, Hill (1957) 
suggests that some addicts are schizoid psy- 
chopaths, and Bender (1963) reports evi- 
dences of schizoid blocking of the perceptual 
field in addict samples, Clark (1962) reports 
that psychiatric illnesses precede or accom- 
pany 40% of drug addiction cases and that 
20% of the cases appear to be sociopathic, 
Nyswander (1956) has found that the re- 
Ported incidence of mental illness among ad- 
dicts is no different from the rate in the 
general population, and she suggests that this 
is the result of addicts’ failure to seek or 
avail themselves of help, and because their 
Psychopathology is often subclinical, 

In summary, addicts have been found to be 
immature, insecure, irresponsible, and ego- 
centric, However, the way in which all these 
traits fit together within any given narcotic 
addict is unclear, This can be seen as the 
result of a lack of a sound theoretical struc- 
ture as the point of origin for these studies, 
Furthermore, although there is some agree- 
ment that the narcotic addict suffers from a 
personality disorder, there is hardly any 


agreement as to the Specific dynamics of this 
disorder, Indeed, 


the population of ada 
very loosely defined 
one personality type. P 


› different patter use (e.g 
s & 


rentia] involve. 
? Psychologica] functions. of 
5 last point, see Chein et a]. 
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the psychopathology orientation is appropri- 
ate to the primary prevention of narcotic 
use. In addition, the use of uncontrolled con- 
vict and clinic populations in this body of 
research casts considerable doubt on the va- 
lidity, reliability, and representativeness of 
the research findings. Furthermore, the per- 
sonality results reported are primarily the 


product of small sample observational 
research, or in many cases, conjecture. 


SOCIOCULTURAL CORRELATES OF ADOLESCENT 
Psycuepetic Druc USE 

Most of the research into psychedelic drugs 
has been devoted to studies of marijuana, 
LSD, and methamphetamine—three distinct 
drugs that will be subsumed here under the 
psychedelic label. 

With respect to the gross group definers 
of sex and age, Pearlman (1968) found no 
difference between psychedelic users and non- 
users in his college population, Harris (1971), 
however, found males more likely to use 
marijuana than females. Hager, Verner, and 
Stewart (1971) found males reporting more 
experience with all types of drugs, although 
both sexes showed similar patterns of con- 
sumption. Mitchell, Kirkby, and Mitchell 
(1971), in a study done with college fresh- 
men, found different use patterns between 
males and females, Hager et al. (1971) found 
a pattern of drug use among adolescents in 
which there was a definite increase in use 
with age. As usual, contradictory evidence 
is available—Mizner, Barter, and Werme 
(1970) found that younger college-age stu- 
dents fell into moderate or heavy use cate- 
gories while older students, even though they 
presumably have had more opportunity for 
being acquainted with drugs, have a more 
experimental or casual use pattern. Their 
sample may have been biased, however, as 
heavy users may have dropped out of college 
and thus may have been underrepresented. 

A number of studies have indicated that 
the social class of psychedelic users is gen- 
erally middle to upper middle class in sig- 
nificant contrast to narcotic addicts (Carey 
& Mandel, 1968; Gillie, 1969; Hager et al., 


1971; Harris, 1971; Kuehn, 1970; Smart 
& Fejer, 1969; Steffenhagen, McAree, & 


Zhevtlin, 1969; Suchman, 1968; Welpton, 
1968). This social class distinction is con- 
sistent for marijuana users as well as users 
of the more powerful psychedelic drugs, that 
is, LSD and methamphetamines. Users of 
LSD commonly have fathers who are either 
professional or managerial and have parents 
with high levels of education (Smart & Fejer, 
1969), 

Although McGlothlin and Cohen (1965) 
feel that atheists, agnostics, and believers in 
pantheistic religions would more likely be 


users than Protestants, Catholics, or Jews, 
Steffenhagen et al. (1969) found 70% of 
their volunteer college sample to be Jewish. 
Finally, Dearden and Jekel (1971) found 
church attendance in a high school sample 
to be inversely associated with drug use. 
Steffenhagen et al. (1969) reported the 
families of marijuana users as not being dis- 
organized and disrupted. In contrast, Harris 
(1971) found marijuana users often had early 
childhood rearing with other than both par- 
ents, while Gillie (1969) reports that the 
percentage of parental loss is higher for 
psychedelic users than for nonusers. Through 
observation, Kuehn (1970) found families of 
psychedelic drug users to be characterized 
by excessive permissiveness, material grati- 
fication with inadequate or inappropriate 
limit-setting, parents with histories of severe 
emotional instability, family conflict sup- 
pressed behind a facade, communication 
binds, parental love more talked about than 
provided, parental role reversals, and a lack 
of tolerance for anger. Welpton's (1968) 
Study of LSD users suggests that the par- 
ents of users have failed in their efforts 
to have an intimate relationship with one 
another. Keniston (1965) described mothers 
of psychedelic users as talented women who 
“gave it all up for marriage" and described 
the fathers as either having failed in life or 
"sold out" for success. An important problem 
with most of these kinds of studies lies in 
their postdictive and retrospective qualities, 
A close knit group which is highly involved 
in psychedelic drug use has been noted as 
being a critical aspect of the peer Subculture 
by a number of ( Freedman, 
1968; Klein & Phillips, 1968: Ludwis & 


researchers 
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PERSONALITY CORRELATES ОЕ 
Psvempyy i. Usr 


Edwards, Bloom, and Cohen ( 1969) foung 
considerable hostility among heavy drug 
users, perhaps reflecting д predisposing state, 
as amount of hostility w. 
degree of drug dependence, Cock 
Marks (1969) iound higher total hostility 
scores among amphetamine users in a juvenile 
delinquent sample, Other personality charac. 
teristics reported by these authors were that 
intropunitive (self-criticism 
and guilt), have higher anxiety, exhibit 
lower extraversion, are more shy and retiring, 
less self-confident, and more radical 
traditional, 


Kuehn (1970) observed 41 coll 
and suggested the 


than 


ege students 
following eight traits for 
student drug users: (a) tendency to live ex. 
cessively in the Present; (b) an 

Passive and reactive Position in inte 
relationships: (c) serious cog 
ties (schizophrenoid disturb 
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sion: (e) study diff; 


excessive 
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to frustration, 
"Тоир dependent 
The same study 


reckless, 
» and less Seli-controlleq. 
also found two distinct types 
of marijuana users, distinguished on the 


basis of academic performances. The high- 
performance user w; 


as found to be intelligent, 
alert, confident 


; and sensitive, while the low- 
Ser was depicted as being shy, 
Cynica], unstable, and less able to handle 
abstract thinking. 

Hogan, Mankin, Conway, and Fox (1970) 
collected California Personality Inventory 
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data from college undergraduate marijuana 
users and nonusers. The users were found to 
be socially poised, open to experience, con- 
cerned with feelings of others, self-confident, 
skilled in interpersonal relations, socially per- 
ceptive, to possess a wide range of interests, 
and to have the achievement motivation nec- 
essary for success in graduate school. Mari- 
juana users were also found to be impulsive, 
pleasure-seeking, rebellious, tending toward 
narcissism and self-aggrandizement, overly 
concerned with personal pleasure and external 
diversion, hostile to roles and conventions, 
somewhat irresponsible, and rather noncon- 
forming. Nonusers appeared to be responsible, 
rule abiding, pleasant, considerate and duti- 
ful, but they also tended to be conventional, 
lacking in spontaneity, flexibility, and verve, 
perhaps too deferential to external authority, 
narrow in their interests and overcontrolled. 

In the Hogan et al. (1970) study, users 
were found to major mostly in the humanities 
and social sciences, and nonusers exhibited 
slightly poorer grades than users. Results of 
other studies have characterized psychedelic 
users as being underachievers (Smart & 
Fejer, 1969), as being indistinguishable from 
nonusers (Pearlman, 1968), and as being 
slightly above average academically (Steffen- 
hagen et al., 1969). In a population of youth- 
ful offenders using amphetamines, Cockett 
and Marks (1969) found no association be- 
tween amphetamine-taking and intelligence 
level. Cohen and Klein (1970) found that 
users in а hospital population were of higher 
intelligence than nondrug users. One uncon- 
trolled variable that could account for this is 
that persons of higher intelligence may be 
more apt to seek out unusual (ie., psyche- 
delic) experiences or that they may be more 
apt to seek hospitalization. 

In an attempt to determine the psycho- 
pathological correlates of psychedelic use, 
McAree, Steffenhagen, and Zheutlin (1969) 
found that marijuana-only users were not 
significantly different from controls оп Min- 
nesota Multiphasic Personality Inventory 
data (in some contrast to Hogan et al., 
1970), Multiple-drug users, however, showed 
high ranking on the schizophrenia scale 
(representing withdrawal, poor interpersonal 
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relationships, aloofness and the inability to 
express emotions). They also scored higher 
on the Masculinity-Femininity scale (noted 
as possibly due to increased sensitivity and 
aesthetic appreciation), the Psychopathic 
Deviate scale and the Hypomania scale. 
Brill, Compton, and Gravson (1971) also 
found that college undergraduate marijuana 
users (some of whom used other drugs) 
scored significantly higher than nonusers on 
the Minnesota Multiphasic Personality In- 
ventory Psychopathic Deviant scale. On the 
other hand, Davis and Brehm (1971) in a 
study of young drug users in the North Caro- 
lina State Prison system, found no significant 
difference between users and nonusers. Users 
in this study consisted of users of both 
psychedelics and hard drugs. 

McAree, Steffenhagen, and Zheutlin (1969) 
hypothesize that the potentially more dis- 
turbed individual is attracted toward the 
more dangerous forms of drugs. Abnormal 
Minnesota Multiphasic Personality Invento- 
ries were found in the LSD users in the study 
by Smart and Fejer (1969), and diagnoses 
of personality disorder or borderline psycho- 
sis were suggested. Frosch, Robbins, and 
Stern (1965) support these findings with a 
sample of 12 LSD patients diagnosed as 
psychotic or borderline psychotic. Blumen- 
field and Glickman (1967) also surveyed a 
hospital population of youthful LSD users 
in which over 70% had had previous psychi- 
atric treatment and 80% were diagnosed as 
psychotic or borderline. They concluded, 
however, that the patients’ difficulties ante- 
dated the use of LSD (see also Flynn, 1970). 
Cohen and Klein (1970) in reviewing a pri- 
vate hospital population, found heavy drug 
users more likely to be character disorders 
than psychotic. They suggest that the diver- 
gence of their findings from other studies 
may be due to the fact that users admitted 
to public hospitals are more likely to be psy- 
chotic (or diagnosed psychotic). The clinic 
population studied by Hekimian and Gershon 
(1968) revealed that 50% of the psychedelic- 
using patients had been considered schizo- 
phrenic or treated for schizophrenia before 
drug ingestion. In marked contrast to the 
diagnosis of narcotic users, Hensala et al. 
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Sadava ( 1970) reviews nine themes which 
may be functions of Psychedelic drug use by 
college students: (a) alienation and search 
or meaning (Blum et al., Dearden & 
Jekel, 1971; Freedman, 


; Leary, Metzmer, & 

Alpert, 1964; Pearce, 1971; Welpton, 1968); 
(5) disillusionment rebellion ( Brooks, 
1971; Dearden & Jekel, 1971; Janowitz, 
i 70; Suchman, 1968); 

С) need for Stimulation (Flynn, 1970; Kenis- 
ton, 19 5; Liebert, 1967); 
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As with research on narcotic use, there 
are definite problems in obtaining data which 
are valid, Tepresentative, and reliable, Due 
to the illegality of Psychedelic drugs, volun- 
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from other variables, and a large number of 
isolated statements have resulted. For these 
statements to be of explanatory value, how- 
ever, their relationship as regards use and 
their place in the development of deviant 
drug use must be made compellingly clear by 
being embedded in an explanatory theoretical 
structure, 

Another related problem with the research 
is the general lack of comparability and co- 
ordination of the sociocultural and personal- 
ity lines of inquiry, and the consequent lack 
of control for personal factors in socio- 
cultural studies, and vice versa. If it is reason- 
able to assume that it is the interaction of 
the person and his sociocultural environment, 
and not either alone, that results in a spe- 
cific behavior—be it studying for an exam or 
smoking marijuana—then it is not enough to 
say, for example, that drug use is the result 
of low personal expectations for success, or 
that it is the result of models and sanctions 
for such use, Rather, it is the combination of 
these that should offer greater prediction and 
explanation, What is required for an adequate 
Psychosocial understanding of drug use is a 
theoretical approach to research embracing 
both sociocultural and personality variables. 

A final critical issue posed by the research, 
reflected in the format of this review, is the 
lack of coordination among alcohol, narcotic 
and psychedelic researchers. The same vari- 
ables may be involved in each kind of use, 
and therefore, there should be coordinated 
examination of all kinds of adolescent drug 
use along theoretically salient social-psycho- 
logical dimensions. For example, the adoles- 
cent narcotic user and alcohol user might be 
found to differ merely in degree of alienation 
(all other things being equal). 

In conclusion, the formulation of adequate 
research designs investigating the  socio- 
cultural and personality correlates of adoles- 
Cent drug use requires a comprehensive theo- 
retical system which takes into account socio- 
cultural and personality factors and relates 
them to the development of deviant drug use. 
With such a theoretical base and with com- 
mensurate methodological sampling and sta- 
tistical sophistication, empirical investigations 
could be made which do justice to the com- 


plexity of the phenomenon of 
adolescent drug use. 


deviant 
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PROCESSES OF FEAR REDUCTION IN 
SYSTEMATIC DESENSITIZATION: 
АМ ADDENDUM TO WILSON AND DAVISON (1971) 


LARRY D. REID? 


Bradley University 


Recent studies are discussed in the context of Wilson and Davison’s review of 
fear reduction techniques tested with animals. In each study, rats had positive 
intracranial stimulation paired with circumstances eliciting avoidance. Results 
demonstrate the possibility of counterconditioning and thereby provide an 
experimental basis for certain procedures of systematic desensitization. 


Wilson and Davison (1971), in citing re- 
sults of Moltz and Nelson, suggested that the 
primary, and perhaps only, function of a 
“counterconditioner,” such as food with ani- 
mals and relaxation with human beings, was 
to act as a facilitator of exposure. They said 
Nelson’s studies “indicate strongly that the 
provision of a stimulus that supposedly elicits 
an anxiety-competing response (food, chlor- 
promazine) facilitates extinction of fear 
through the increased exposure to the CS 
which it can bring about [p. 5].” Studies pro- 
viding strong evidence for the efficiency of 
Counterconditioning in reducing indexes of 
fear used the conditioned suppression and 
Passive-avoidance paradigms and not the 
active-avoidance paradigm which may be the 
better analogue for desensitization. Conse- 
quently, Wilson and Davison were hesitant 
to conclude that these findings were appli- 
cable. Other studies purporting to demon- 
Strate the efficacy of counterconditioning 
were criticized primarily on methodological 
grounds. 

Rats are subjects of choice for testing many 
Propositions; however, it is difficult to manip- 
ulate affective states in rats as relaxation or 
Pleasant imagery allows in desensitization 
therapy. The trained client can regulate, at 
request, sequencing of anxiety-eliciting pro- 
cedures and relaxation rather precisely. With 
animals, providing food has been the most 
frequently used counterconditioner. However, 
some levels of fear inhibit feeding (Estes & 
Skinner, 1941) making it difficult to balance 
———Ó—M—— 

3 reprints should be sent to Larry 
D. сш Psychology, Bradley Uni- 
Versity, Peoria, Illinois 61606. 


propensity to eat against fear so that food 
can countercondition fear. Also, it is not 
known when food has its effect. New tech- 
nology, however, now allows precise control 
of intensity and temporality of affective states 
in rats through the procedures of direct elec- 
trical stimulation of hypothalamus. Extensive 
research with intracranial stimulation (ICS) 
has made it possible to impose upon rats an 
unambiguously positively reinforcing stimu- 
lus (Olds, 1962; Reid, 1967). Furthermore, 
ICS can mask or attenuate pain and fear 
(Reid, 1967). Three recent studies have 
capitalized on the advantages provided by 
ICS and have tested the efficacy of counter- 
conditioning while controlling important vari- 
ables such as exposure time of the conditioned 
stimulus (CS) and temporality of counter- 
conditioner. 

Gordon and Baum (1971) trained rats to 
avoid footshock by jumping to a retractable 
platform. Prior to training for avoidance, rats 
had been fixed with electrodes to deliver posi- 
tive ICS. After avoidance was established, 
footshock was terminated and rats were pre- 
vented from jumping by removal of the plat- 
form (response prevention). One group also 
received trains of positive ICS throughout the 
prevention time. The group that received re- 
sponse prevention plus ICS was most effi- 
ciently deconditioned (median measures of 
response strength were less by a factor of 25). 

Monico (1971) fixed rats with electrodes 
to stimulate positive hypothalamic areas. 
They were then placed in a chamber deliver- 
ing footshock 5 seconds aíter light (CS) onset 
with CS and shock continuing until а lever 


press. If there were a press before 5 seconds 


107 


108 


group. The g 
CS with ICS 


een groups 


and after Cs and bet 


Á Paired 
ISlve event, efficiently att 

ж у €nuate 

persisting avoidance Conditioneq siima 


LARRY D. REID 


exposure time was the same regardless of 
whether or not rats received ICS; conse- 
quently, ICS did not merely facilitate ex- 
posure to the CS. We need not hedge; there 
is clear evidence from the animal laboratory 
that extended exposure to fear-eliciting condi- 
tions without aversive consequences typically 
leads to deconditioning of the conditioned 
emotional response, of passive avoidance, and 
of active avoidance, but that deconditioning 
in each paradigm is facilitated when CS is 
accompanied by a positive affective stimulus. 
In addition to being a facilitator of exposure, 
an anxiety-competing or positive stimulus can 
be а counterconditioner. 

The mechanism of counte 
unknown. A start tow. 
anism seems to be em 


rconditioning 15 
ard specifying the mech- 
bodied in those attempts 

to specify the interrelatedness of forebrain 
systems mediating Positive and negative affect 
(Stein, 1964; Young, 1967), The structure 
of the limbic system, allowing reverberation 
and positive feedback, is an adaptation for 
maintaining behavior in the absence of im- 
mediate stimulus control. A countercondi- 
tioner may be any event that would interrupt 
or diminish reverberation maintaining fear. 
Studies reviewed here lead to a different 
emphasis from that of Wilson and Davison 
(1971). But more importantly, they are à 
demonstration of & general technique that 
will allow for preclinical screening of potential 
therapies, Using avoidance paradigms with 
animals and ICS, it is possible to vary SYS” 
tematically parameters of strength of avoid- 
ance and parameters of “therapy” as spacing, 
intensity, quality, and frequency of affective 
change. Such parametric studies should yield 
conclusions about the most efficient way (0 


reduce fear and anxiety without harmful side 
effects. 
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since 48 of the 81 small squares are under 
the OD curve. 


Performing Standard. Significance Tests with 
an Ordinal Dominance Curve 


Most or all of the computation in at least 
five common nonparametric significance tests 
can be avoided once an OD curve is con- 
structed. The five are the 2 X 2 chi-square 
test, the median test, the Mann-Whitney U 
test, the Wald-Wolfowitz runs test, and the 
two-sample Kolmogorov-Smirnov test. 

Consider first the 2 X 2 chi-square test 
computed when a worker divides a scale at 
some point and tests the null hypothesis that 
equal proportions of the two groups lie above 
(or below) that point. If the groups are 
roughly equal in size, then the phi or cor- 
relation for any point can be read by finding 
the appropriate values of Р, and Р» in Fig- 
ure 3.* For example, in Figure 2 the point 
between scale values of 6 and 7 corresponds 
to a phi of about .28. If the conditions for a 
chi-square test are met, then the value of 
phi corresponding to a given significance level 
can be found either from a table for the sig- 
nificance of the Pearson r with N, + Ns cases, 
or from the relation ф = \/x;2/(N + N»). 
In the present example, either method shows 
the critical value at the 5% level to be .37. 
If sample sizes are too small for this method, 
Fisher's exact 2 X 2 test may be used instead. 

'The median test (Conover, 1971, pp. 167- 
172; Siegel, 1956, pp. 111-116) consists 
simply of dichotomizing at the median and 


*Figure 3 was drawn with the assumption that 
the two groups are exactly the same size, However, 
investigation has shown that it is quite accurate 
within the range of figures usually encountered. For 
example, if the larger group is not more than 5/4 
the size of the smaller, and if both within-group 
Proportions are between .1 and -9, and if phi as 
read from Figure 3 is 30, then it can be shown 
that the true phi is between 29 and .31. The error 
is thus so small that it can be largely ignored. In 
general, the margin of error is Ereatest around the 
edges of Figure 2, and smallest in the center. Since 
the major reason for computing phi is usually to 
test the significance of its difference from zero, and 
since the chi-square significance test is also usually 
inaccurate at the extreme edges of Figure 2, we can 
conclude that moderate ineq 


qualities of Sample sizes 
place few or no more limitations on the use of Figure 
3 than are placed by the limitations of the chi-square 
test. 


then performing a 2 X 2 chi-square test as 
above. The median is readily found in an OD 
curve, so this test too can be done with little 
or no computation. 

The Mann-Whitney test (Conover, 1971, 
рр. 224-236; Siegel, 1956, pp. 127-136) also 
has a remarkably simple relation to an OD 
curve. The number of small squares below the 
OD curve equals the number of units by 
which the rank sum for Group 1 exceeds its 
minimum possible value of IV4 (IN; + 1)/2, 
and the number above equals the number of 
units by which the rank sum for Group 
2 exceeds its minimum possible value of 
N»(N»--1)/2. In Figure 1, for example 
where N, = N» = 9, the minimum possible 
value of either rank sum is 45. Since there are 
33 small squares above the OD curve, the 
rank sum for Group 2 equals 45 + 33, or 
78. From any standard table, this corresponds 
to a one-tailed probability level of .273. Al- 
ternatively, U in the Mann-Whitney test is 
the smaller of the number of squares above 
or below the OD curve—33 in Figure 1. 

A Wald-Wolfowitz runs test (Conover, 
1971, pp. 350-356; Siegel, 1956, pp. 136- 
145) can be performed, once an OD curve 
has been constructed, simply by counting the 
segments of the curve. In Figure 1, for ex- 
ample, there are nine vertical or horizontal 
segments of the OD curve, hence nine runs. 
With №, = N; = 9, six or fewer runs are re- 
quired for significance. 

The test statistic in the Kolmogorov- 
Smirnov two-sided two-sample test (Conover, 
1971, pp. 309-314; Siegel, 1956, pp. 127- 
136) is the largest difference between the two 
cumulative frequency distributions, which 
equals the largest vertical distance between 
the OD curve and the diagonal line. In 
Figure 1, for example, the largest difference 
is 4/9 or .44; a difference of .67 would be 
required for significance. 


MIXED AND COMPLETE DOMINANCE: NEW 
VERSIONS or Hyporursrs ABOUT Dis- 
PERSION AND ELEVATION 


No standard statistical test examines hy- 
potheses of dispersion or elevation in the form 
in which they are frequently most interesting 
to the behavioral scientist. 


P.Proportion of the upper subgroup 


40 


50 4 70 


PrProportion of the lower subgroup 


Fic. 3. A chart for making graphic estimates of phi (¢) when two groups 
are the same size, (Reprinted with permission of the McGraw-Hill Book Co. 
From J. P. Guilford, Psychometric methods. New York: McGraw-Hill, 1954.) 


Mixed Dominance and Dispersion 


A typical example of a hypothesis about 
ispersion is given by Siegel (1956): “Women 
Who have personal difficulty in handling ag- 
StéSsive impulses are more extreme than 
Others in their judgments of hostility in film 
Characters- some underattribute and others 
Overattribute |p. 149]." The data used to 
test this hypothesis were the hostility-attribu- 
lion scores of a group of women with diífi- 
Culties controlling their aggressive impulses, 
and another group of women without such 
difficulties, The data are given in Figure 1 
Of the present paper. The hypothesis predicts 
that scores in Group 1 should be more vari- 
able than those in Group 2. This prediction 
Was confirmed by the Siegel-Tukey (1960) 
test and the Freund-Ansari (1957) test, so 
the hypothesis quoted above would appear to 
* confirmed, Thus a double conclusion is 
accepted: Some members of Group 1 over- 
attribute and some underattribute. Inspection 


of Figure 1, however, lends little support to 
the conclusion that some underattribute, The 
two distributions are nearly identical at the 
low end of the scale, since the OD curve 
roughly follows the diagonal. As we shall show 
more fully later, the data are consistent with 
the hypothesis that the two group distribu- 
tions are identical below their medians but 
that Distribution 1 has a longer tail at the 
high end. Thus the conclusion that the two 
distributions differ in variability is Supported, 
but the original hypothesis quoted from 
Siegel is not. This is unfortunate, Since the 
quoted hypothesis is clearly more interesting 
than the hypothesis actually confirmed by 
the Siegel-Tukey and Freund-Ansari tests, 
The quoted hypothesis states that both 
tails of Distribution l contain more Cases 
than the tails of Distribution 2; that is, 
P, — P» is positive at the low end of the scale 
and negative at the high end. Thus the OD 
curve must cross the diagonal, being below 
it at the low end and above it at the high 
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COMPARING TWO GROUPS BY SIMPLE GRAPHS: 


RICHARD B. DARLINGTON ? 
Cornell University 


igni ests for testing whether two independent groups are equal 
een ере ues do not test the forms of those hypotheses which are 
frequently of most interest to behavioral scientists. More interesting forms of 
those hypotheses can be tested easily and quickly by drawing a simple qe 
The same graph can also be used to períorm several standard tests. In all these 
tests, the procedure is so simple that the chance of computational error is much 


smaller than with standard procedures. 


When a significance test is used to reject 
the null hypothesis that two independent sets 
of scores are identically distributed, the al- 
ternative hypothesis is almost always one of 
three types. In order of their frequency of use 
they are: hypotheses specifying differences in 
elevation or location (usually tested by the ¢ 
test, Mann-Whitney U test, or the median 
test or other 2 X 2 tests); hypotheses specify- 
ing differences in variation or dispersion (usu- 
ally tested by the F test on variances or by 
the nonparametric Siegel-Tukey or Freund- 
Ansari tests); and hypotheses which do not 
specify the nature of the differences (usually 
tested by the Wald-Wolfowitz runs test or 
the Kolmogorov-Smirnov test). A simple 
graph makes it possible to perform most of 
these tests in ways which are quick, easy to 
understand, and almost completely free of 
the chance of computational error. Some new 
tests, which test hypotheses about elevation 
and dispersion which are frequently more in- 
teresting to behavioral scientists than any of 
the specific hypotheses tested by standard 
tests, are also performed easily. The graph 
will be called an ordinal dominance or OD 
Curve, though a somewhat similar curve is 


called a P-P curve by Wilk and Gnanadesikan 
(1968). 
n —— 

! For many helpful criticisms and Suggestions the 
author is indebted to his Student. 
Jean Rom, Sharon Weinberg, Robert Wunderle and 
Others, and to his colleagues Jason Mil 


e Millman, T. A. 
Ryan, and Gordon Wilcox. Any errors remain his 
own. 


? Requests for reprints should be sent to Richard 
- Darlington, Department of Psychology, Uris 
Hall, Cornell University, Ithaca, New York 14850, 
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ORDINAL DOMINANCE CURVES AND THEIR 
RELATION TO SOME FAMILIAR 
STATISTICAL PROCEDURES 


Construction of an OD curve is very 
simple, as can be illustrated by Figure 1. 
First, a rectangle is drawn with length and 
height, respectively, of IY, and М», the sizes 
of the two groups. Then the scores of Group 
1 are ordered and written immediately below 
the rectangle. The Scores of Group 2 are 
ordered and written immediately to the left 
of the rectangle.? 

Each segment of an OD curve represents 
the score broadside to it. Starting at the 
lower left corner, the OD curve is drawn by 
drawing a continuous curve upward and to 
the right so that each successive curve seg- 
ment represents the smallest score not repre- 
sented by a previous Segment. In Figure 1, 
for example, the smallest score is 0 from 
Group 1, so the first segment of the OD curve 
is horizontal. The next smallest score is 3 
from Group 2, so the next segment is vertical; 
and so on. Thus the N, + V 2 Scores are repre- 
sented in the order of their size, from smallest 
to largest, as shown in Figure 1. 

It is often convenient to w 


‘rite the within- 
group rank orders adjacent to the scores, and 


the values of P, and Ps, the Corresponding 
Proportions, on the right and upper sides of 


the figure. Each value of P is written adjacent 
to a line, not a s 


quare, of the grid. Thus a Р 
value of, Say, .5 corresponds to the division 
between the two median scores of a group of 


even size, rather than to one score or the 
Other. A strai 


ght line from the Tectangle’s 
lower left to upper right corner may also be 
drawn showing the points where Py = Py. 


? These data are taken from Siegel (1956, p, 149), 


x. 
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When scores from different groups are tied, 
the OD curve is drawn diagonally across the 
spaces representing the tied scores. This is 
illustrated in Figure 2, which shows the rat- 
ings on a 9-point scale given by subjects in 
an experiment by Jones and Panitch (1971). 
The topmost segment of the OD curve, for 
example, represents all the scores of 9—four 
scores from Group 1 and one score from 
Group 2. It is drawn diagonally across the 
space represented by those scores. 

An OD curve is a type of cumulative fre- 
quency curve; it shows the cumulative fre- 
quency of Group 2 as a function of the cumu- 
lative frequency of Group 1. 

The shape of the OD curve is determined 
entirely by the rank order of the observa- 
tions, and that rank order can be recon- 
structed completely from the shape. The 
curve is thus not a summary of the data; 
rather, it is a complete representation of the 
rank-order properties of the data. 


Basic Descriptive Statistics Easily Read from 
an Ordinal Dominance Curve 


Four common types of descriptive statistics 
are easily read from an OD curve. None of 
these is easily read from standard frequency 
curves, 

The slope of the straight line between any 
two points on an OD curve equals the relative 
frequency of Groups 1 and 2 within the inter- 
Val bounded by those points. For example, 
Consider the point on the OD curve in Figure 
2 between segments representing scale values 
of 5 and 6. A straight line from the lower 
left corner to this point has a slope of 6/3, 
Or 2, so twice as many subjects from Group 2 
as from Group 1 are in the interval 1-5. 

The vertical distance by which the OD 
Curve lies above the straight diagonal line at 
any point equals P» — P; at that point. 

If any point in an OD curve is connected 
With the diagonal by a straight line with E 
slope of — 1, the value of P; or P» where this 
line meets the diagonal equals the proportion 
9f the combined group falling below the point. 
For example, a line through the midpoint of 
the diagonal in Figure 2, with slope — 1, 
intersects the OD curve between segments 
representing scores of 6 and 7. Thus 50% of 
the combined group falls below this point. 
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Fic. 1. An OD curve for data with no ties 
between groups. 


Each small square represents a comparison 
between the Group 1 member below the 
square and the Group 2 member to its left. 
Since the squares below the OD curve repre- 
sent comparisons in which the Group 1 mem- 
ber is higher than the Group 2 member, the 
proportion of the rectangle's area under an 
OD curve equals the probability that a ran- 
domly chosen member of Group 1 will score 
higher on the scale than a randomly chosen 
member of Group 2 (with any ties split 
evenly between the groups). For example, in 
Figure 1 this probability is 48/81 or .59, 
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Fic. 2. Ап OD curve for data with many ties, 
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end. We shall define the case in which the 
OD curve touches or crosses the diagonal as 
the case of mixed dominance. The opposite 
case, in which the OD curve stays entirely 
on one side of the diagonal, will be termed 
complete dominance. The Siegel hypothesis 
is, then, the hypothesis that mixed dominance 
occurs in the population studied. 

This is not an isolated example. In many 
applications a difference in dispersion is of 
little interest unless it implies mixed domi- 
nance. Even if a difference in variances is in- 
teresting, a further demonstration of mixed 
dominance usually makes it still more in- 
teresting. 


Complete Dominance and Elevation 


Hypotheses about differences in elevation 
are usually tested by the ¢ test, the Mann- 
Whitney rank-sum test, or the median test. 
One group may be "higher" than the other 
in the sense tested by any of these tests (or 
all three together) even though mixed domi- 
nance exists. Yet, if mixed dominance exists, 
then a general, unqualified statement like 
“Group 2 is higher than Group 1” seems 
questionable, since the proportion of Group 1 
above some scale points then exceeds that of 
Group 2. For some purposes, then, Group 1 
may be “better” or "higher" than Group 2, 
even though Group 2 is higher in the sense 
tested by any standard test, 

Even if a measure is clearly interval or 
ratio, like number of right answers on a 
multiple-choice test, the group with the lower 
true mean may have a higher proportion of 
members above some important point on the 
scale, due to differences in shape or variance 
of within-group distributions, This result can- 
not occur under an assumption sometimes 
mentioned by Statisticians—that two distri- 
butions are identical in variance and shape 
(Conover, 1971, p. 225). However, the as- 
sumption is so narrow that it is almost always 
ignored in practice (as in Conover, 1971, p. 
226), and is not even defined f 
scales, 

In many applications, then, the 
supported by standard tests of elevation are of 
much less interest than the hypothesis of com. 
plete dominance, If Group 2 completely domi. 
nates Group 1, then the proportion of Group 


or ordinal 


hypotheses 


2 exceeds that of Group 1 above any point 
on the scale. Group 2 is then also higher than 
Group 1 in the sense tested by the Mann- 
Whitney test; that is, a randomly chosen 
member of Group 2 has a better-than-even 
chance of being higher than a randomly 
chosen member of Group 1. If the scale is 
interval, the mean of Group 2 is not only 
higher than that of Group 1 but would re- 
main higher aíter any monotonic transforma- 
tion of the scale. This is of particular interest 
to a worker who thinks of his observed scale 
as being monotonically, but not necessarily 
linearly, related to some other variable of 
more fundamental interest. For example, a 
worker may think of an exam score as having 
a monotonic but nonlinear relation to "'abil- 
ity" or "knowledge," or a decision theorist 
may think of an observed scale as having 
a nonlinear relation to “utility.” Complete 
dominance of Group 2 would then imply that 
Group 2 is higher than Group 1 in mean 
ability or knowledge or utility, even though 
the precise relations between these variables 
and the observed measures are not specified. 
Thus, if complete dominance is demonstrated, 
there need be no hesitation in stating a gen- 


eral unqualified conclusion like “Group 2 is 
higher than Group 1.” 


Procedures for Demonstrating Mixed and 
Complete Dominance 


A worker should first attempt to reject the 
identity hypothesis (the hypothesis that the 
two group distributions are identical in every 
Way), using one of the tests mentioned 
above. If the identity hypothesis is rejected, 
and the sample OD curve crosses the diagonal 
line, the next step is to attempt to reject the 
complete dominance hypothesis, thereby ac- 
cepting the mixed dominance hypothesis as 
the only remaining alternative, To do this, 
the worker should divide his sample into two 
subsamples, one consisting of those subjects 
above the point where the OD curve crosses 
the diagonal line, and the other consisting of 
those below that point The worker then 
CHIRAC 


^ If the OD curve crosses the line at several points, 
any one of these may be chosen, We suggest, for 
reasons concerning the power of the tests to follow, 


choosing the point closest to the median of the 
total sample. 
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Should perform a test for differences in eleva- 
tion in each subsample, attempting to show 
that the OD curve goes significantly above 
the diagonal line in one subsample and signifi- 
cantly below it in the other." The mixed domi- 
nance hypothesis then is accepted at the 
higher (ie, less Significant) of these two 
probability levels, In Figure 1, for example, 
the upper and lower subsamples contain, re- 
spectively, five and four subjects from each 
group. In the upper subsample, the Mann- 
Whitney test rejects the identity hypothesis 
at the .016 level (one-tailed). In the lower 
subsample, however, the comparable proba- 
bility level is merely .243. The hypothesis of 
mixed dominance thus is not accepted in this 
example. 

If the identity hypothesis is rejected but 
mixed dominance is not shown, a worker may 
wish to support a conclusion of complete 
dominance, The simplest way to reach such 
à conclusion is to simply assume írom the 
nature of the problem that mixed dominance 
does not exist. Since the identity hypothesis 
has been rejected, the complete dominance 
hypothesis then is accepted as the only re- 
maining alternative. Though it may appear 
trivial at this point in the exposition, this 
Conclusion answers a question of considerable 
interest to behavioral scientists. A worker 
Who has found a significant difference in the 
elevation between two groups by a / test or 
Other standard test is often dissatisfied with 
the narrowness of the conclusion that he is 
technically allowed to draw from this result. 
He often asks whether there is some assump- 
tion which, if made, will make his result of 
More general interest. We have already seen 
that an assumption often mentioned by statis- 
ticians, that two group distributions are iden- 

" The fundamental fact which makes this pro- 
cedure acceptable is the fact that if the null hypothe- 
sis is correct, each of the Mann-Whitney statistics 
Will be distributed as shown in the Mann-Whitney 
tables in the subset of samples in which the OD 
curve touches the 45-degree line at the point chosen. 
In other words, this significance test is based not 
9n all possible samples, as most significance tests 
аге, but on all samples with a particular character- 
istic. This technique is the basis of the Fisher exact 
2 X2 test and other randomization tests. In the 
Present case, the two Mann-Whitney tests are 
Statistically independent of each other in the subset 
Of samples described. 


tical except for their means, is inapplicable 
to ordinal scales and too restrictive when 
applied to interval scales. We now see that 
a much less restrictive assumption, the as- 
sumption of no mixed dominance, enables 
a worker to draw a very general conclusion, 
the conclusion of complete dominance, from 
а f test or other standard test of elevation. 

A worker who is unwilling to assume ab- 
sence of mixed dominance may still reach 
à conclusion of considerable interest by a 
further analysis of his data. We saw earlier 
how he can perform a 2 x2 chi-square-type 
test, by comparing a point on an OD curve 
to Figure 3, after determining the value of 
phi necessary for significance. In a few sec- 
onds, a worker can perform this test for every 
point on the OD curve, marking the points 
which turn out to be significant. The points 
very near the two ends of the OD curve can 
never be significant by this test, since the OD 
Curve necessarily approaches the diagonal at 
its two ends. However, there may be a sub- 
stantial range in the middle of the curve in 
which every point is significant. If 50, We can 
say that a complete dominance relation exists 
within this range; This implies that the 
Proportion of, say, Group 2 exceeds the pro- 
portion of Group 1 above every point within 
the range, and that the mean of Group 2 ex- 
ceeds that of Group 1 for any monotonic scale 
transformation which is flat outside the range. 

This procedure has the strong advantage 
Over ¢ tests or other standard techniques that 
it clearly specifies the range over which a con- 
clusion about a difference in elevation is 
known to be valid. The common-sense ob- 
servation that conclusions about geniuses or 
star athletes should not be drawn from ran- 
dom samples of schoolboys is thus made more 
specific. A worker who uses a standard test 
on elevation to draw an unrestricted conclu- 
sion like “farm boys in general are more 
physically fit than town boys” (as in Conover, 
1971, p. 226) will Perhaps tempt some readers 
КЕТТ the possibility that а range of 
complete dominance may be found just by chance, 
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to extrapolate the conclusion beyond the scale 
levels actually warranted by the data. Even 
a careful and well-intentioned reader will be 
unclear about just how much extrapolation is 
warranted. Using the present technique, how- 
ever, the investigator will be able to make a 
precise statement about the range over which 
his conclusion is known to apply. 

In summary, the present technique has the 
following advantages over conventional tech- 
niques: (a) It specifies precisely the range 
over which its conclusions are know to apply; 
(5) its conclusions within that range are very 
strong and interesting; (c) these conclusions 
are reached without any assumptions beyond 
ordinal scaling of variables; (d) the pro- 
cedures used are quick, easy to understand, 


and free of serious chance of computational 
error, 
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THE VIEW FROM THE PASSIVE VOICE 
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An integration of concepts from traditional and 


generative linguistics and a 


reinterpretation of experimental research in psychology have led to the following 


conclusions: (a) Chomskyan deep structure 
between the active voice and the passive 


aptures the semantic commonality 
voice. (b) The semantic differences 


between the voices are attributable primarily to Properties of the Passive, the 
active serving as a neutral base by reference to which the passive takes on its 


significance. This asymmetrical re 
the marked status of the passive 


tion between the voices is a reflection of 
a-vis the unmarked active. 


(c) The func- 


tions distinguishing the passive from the active are (i) to allow for emphasis 


of the logic: [d 
speaker to build a presuppositional ini 


rast 


al object and to provide discourse continuity and (4) to enable the 


ructure into his message. For instance, 


the sentence The boy was bitten by the dog presupposes that the boy was 


bitten by someone and 


asserts that that someone was the dog. 


Actives are 


considered to be generally presuppositionally neutral. 


Language affords a rich variety of oppor- 
tunities for changing the syntactic structure 
of a message while retaining its essential con- 
tent and lexical composition. For instance, 
Sentences 1-5 below exemplify only a sample 
of the structural paraphrases of the content 
involved. 


1. The police believe that John has stolen 
the jewel. 

2. The police believe that the jewel has been 
stolen by John. | 

3. John is believed by the police to have 
Stolen the jewel. . 

^. "The jewel is believed by the police to have 
been stolen by John. 

5. What the police believe John to have 
stolen is the jewel. 


What are the functions of structural para- 
Dhrase? This paper attempts to show that 
Some recent work in generative linguistics in- 
Volving, in particular, the notions presupposi- 
tion and markedness point to new avenues for 
answering this question. The discussion will 
be developed around the active-voice-passive- 
voice relation, which has generally been con- 
Sidered as the paradigmatic case of structural 
Paraphrase. | R 

The paper is divided into four sections. 

1 Requests for reprints should be sent to Moshe 
Anisfeld, Department of Psychology, Yeshiva Uni- 
Versity, New York, New York 10003. 
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The first section deals with the formulation 
of the essential commonalities between the 
two voices. The second section deals with 
differences between them in terms of the 
notion of markedness, The third section dis- 
cusses the semantic differences between the 
active and the passive as they have been 
traditionally conceptualized by linguists and 
studied experimentally by psychologists. The 
studies reviewed in the third section are re- 
interpreted from the perspective of marked- 
ness developed in the preceding section. The 
fourth section elaborates the presuppositional 
differences between the two voices and ex- 
amines the interaction of presupposition and 
markedness, 


Active-Passive C. ommonalities 


The active voice and the passive voice are 
generally recognized as basically synonymous, 
Thus, for instance, Sentences 6 and 7 express 
the same content, 


6. The dog bit the boy. 
7. The boy was bitten by the dog. 


These two sentences are identical with respect 
to truth value: if sentence 6 is true, Sentence 
7 is also true, and vice versa. However, the 
close affinity between the two voices had not 
received formal expression till the advent of 
generative transformational srammar (Chom- 
sky, 1957). The introduction of a deep leve] 
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of syntactic analysis in addition to the sur- 
{асе level of previous theory provided trans- 
formational grammar the vehicle for formu- 
lating the essential similarity between the 
two voices. On the suríace level, the simple 
active (as in Sentence 6 above) has the 
structure shown in 8 below and the passive 
has the structure shown in 9. 


8. NP, — Aux — V — NP» 
9. NPs — Aux + be + en — V 


by + NP, 


Where: NP = noun phrase (e.g., the dog, the 
boy), V = verb (eg. bite, break), Aux = 
forms that specify number and tense of verb, 
en = participial form of verbs (e.g., bitten, 
broken). Comparison of Structures 8 and 9 
shows that what is the grammatical subject 
(NP; ) in the active is the grammatical object 
in the passive and what is the grammatical 
subject in the passive (NP.) is the grammati- 
cal object in the active. On the deep-structure 
level, however, the two constructions have 
the same representation. The deep structure 
expresses the logicosemantic relations, which 
are common to both voices. In deep structure, 
NP, is the logical subject of both Sentences 
бапа 7 and NP» is the logical object of both 
sentences. Thus, in the active there is a co- 
incidence between logicosemantic relations 
and grammatical relations, whereas in the 
passive, the logical subject is the grammatical 
object and the logical object is the grammati- 
cal subject. The mechanism effecting this in- 
verted mapping of the deep structure onto the 
surface structure is the passive transforma- 
tion. This transformation, along with such 
other transformations as ne 
tion, was initially considered to be optional 
(Chomsky, 1957, pp. 42-43, 77-83, 112), 
but later, in order to prevent transformations 
from changing meaning, it, along with the 
other ones, was formulated as obligatory and 
dependent on the presenc n 


А е of an optional 
marker in the deep structure 


1965, pp. 103-106, 223; Katz & Postal, 1964 
Dp. 72-74). This change is of little psycho- 
logical consequence except as it reflects the 
restriction of the semantic analysis of genera- 
tive grammar to deep structure relations and 
the consequent inability of the theory to ac- 
commodate potential differences between the 
active and the passive. However, the main 


gation and ques- 


(Chomsky, 
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thrust of generative grammar in its first 
decade was to account for linguistic similari- 
ties unrecognized or unexpressed in descrip- 
tive theories of language, and this purpose, 
as we have seen, it achieved with respect to 
the active and passive constructions, Genera- 
tive theory also provided a formal justifica- 
tion for the general practice of grammarians 
to consider the active as more basic than the 
passive. The greater complexity of the passive 
is due to the transformation that interchanges 
the positions of the two noun phrases. 


Active-Passive Formal Differences: On 
Markedness 


The greater transformational complexity of 
the passive in relation to the active can be 
profitably conceptualized within the theory 
of markedness. The notion of markedness 15 
designed to distinguish between the typical 
or normal cases (unmarked) and the ones 
that in comparison are less typical (marked). 
The concept was developed in the 19305 
within the Prague school of linguistics and 
has been revived in Greenberg’s (1966) work 
on language universals as well as in genera- 
tive phonology (Chomsky & Halle, 1968, Ch. 
9; Postal, 1968, Ch. 8). In this section, We 
will first discuss markedness in the realms 
of phonology, morphology, and semantics 
where it has been mostly studied, and then; 
extend it to our present syntactic case of 
active-voice-passive-voice relations. 

The notion of markedness developed from 
an observation of asymmetries in the behav!or 
of closely related items. One of the chief 
manifestations of the asymmetry is the g€- 
neric behavior of one of the items (un- 
marked) in comparison with the specific be- 
havior of its mate (marked). Thus, in an- 
tonymous adjectival pairs such as long-short, 
the first term stands not only for one pole of 
the dimension but also for the entire category. 
This asymmetry is reflected in the status os 
questions. The question How long 15 РЕ 
paper? does not imply that it is either long 
or short. Whereas the question How short X 
the paper? does imply that it is short. Also, 
length, derived from long, refers to the entire 
dimension. In phonology, the generic chat- 
acter of unmarked forms is reflected in the 
Phenomenon of neutralization. In certain 
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phonological environments only one member 
of a phonemic pair occurs, representing the 
entire superordinate archiphoneme. A second 
characteristic of unmarked items is their rela- 
tive structural simplicity. Comparing the sin- 
gular (unmarked) with the plural (marked) 
noun, Greenberg (1966) has noted that “The 
singular frequently has no overt mark while 
the plural is marked by an affix as in En- 
glish .. . [p. 73]." A third point noted about 
unmarked forms is that they tend to be dis- 
tributionally freer than their marked mates. 
For instance, in English /s/ can precede 
vowels, consonants, liquids (e.g. sip, spit, 
Ski, sleep) and one semivowel (e.g., sweep), 
Whereas /9/ is restricted to vowels and the 
two semi-vowels (e.g., thin, throw, thwart). 
In phonological analysis, these two sounds 
/s/ and /0/ constitute a minimal pair differ- 
ing only in one feature (stridency). The 
sound /s/ would be considered unmarked and 
/0/ marked. This decision is supported by 
the next two criteria for markedness. The 
fourth point concerns greater frequency, 
within a given language, of the unmarked 
item, Thus /s/ occurs 12 times as frequently 
in English as /0/ ( Miller, 1951, p. 86). A 
fifth characteristic of unmarked items is their 
vider presence in the languages of the world. 
In this connection, Chomsky and Halle 
(1968) have remarked that the least marked 
Consonants in English, including /pt k s n/, 
"are rarely absent in the phonological system 
ofa language [p. 413]." Finally, unmarked 
forms tend to interact more closely with other 
grammatical categories than marked forms. 
For instance, in German and other gender 
languages, the adjective agrees in gender with 
the singular (unmarked) noun but only one 
adjectival form exists for the plural (marked) 
noun, 

The coordination of these different ear- 
marks of markedness with one another and 
across the different levels of language (phono- 
logical, morphological, syntactic, and seman- 
lic) has not been settled yet on the rigorous 
theoretical level, but their interrelatedness 
is quite apparent on the intuitive level (see 
Greenberg, 1966). All the criteria converge 
On the generalization that the unmarked term 
is more natural and basic than the marked 
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one. Generative linguists have assigned central 
importance to the criterion of structural sim- 
plicity, in extended form, in the determina- 
tion of markedness. The extension has been 
beyond local simplicity of one item in relation 
to another and involves consideration of the 
effects on the entire System of generalizations 
of choosing a particular item as the unmarked 
one. The notion of simplicity has been treated 
in formal terms, but it is quite likely that 
structural simplicity will be correlated with 
psychological simplicity in internal processing 
and in production and perception. Discussing 
markedness in Phonology, Postal (1968) has 
stated that “Ultimately, perhaps some of the 
strongest evidence for assignment of Marked 
or Unmarked status will come from physio- 
logical and perceptual investigations [pp. 
170-171]." Following Roman Jakobson, he 
has also suggested ( Postal, 1968, p. 172) that 
unmarked items should appear before their 
marked counterparts in the course of linguis- 
tic development in children, 

Psychological research, concentrating 
mainly in the lexical domain, has explored 
some of the behavioral correlates of marked- 
ness. The research of Clark and his associates 
(Clark, 1969; Clark & Card, 1969, and refer- 
ences therein) has suggested that marked ad- 
jectives are coded in memory in terms of 
their unmarked counterparts, For instance, 
marked adjectives tended to be recalled much 
more often as their unmarked counterparts 
than vice versa, Hamilton and Deese (1971) 
found that unmarked adjectives, even when 
their referential meaning was neutral, were 
classified as more positive than their marked 
counterparts, Since, however, they have shown 
that unmarked adjectives are used more fre- 
quently, it is not clear whether the differences 
in evaluation are due directly to markedness 
or are mediated by frequency, Experiential 
frequency has been found to be correlated 
with evaluative judgments (see Anisfeld, 
1968; Johnson, Thomson, & Frincke, 1960), 

Returning now to the passive from the 
perspective of the foregoing discussion, jt 
seems clear that the English passive (as well 
as the passive in other languages, see Green- 
berg, 1966, p. 86) should be considered 
marked and the active unmarked. Tn the 
first place, the passive is longer and transfor- 
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mationally more complex than the active. The 
passive entails all the transformations the 
active does plus the one that interchanges 
the positions of the two noun phrases. Chom- 
sky (1957, pp. 79-81) has shown that re- 
versing this state of affairs by making the 
passive closer to the underlying structure and 
deriving the active by a special transforma- 
tion would render the grammar more compli- 
cated and cumbersome. The passive is also 
much less frequently used than the active, 
possibly because of its greater complexity. In 
an extensive survey of present-day English 
sentences totalling 320,000 words, Svartvik 
(1966, p. 46) found that the frequency of 
passives was one-seventh that of actives. Also, 
the formation of passives appears to be re- 
stricted. As Allen (1959) has noted, “most 
sentences that are good in the active voice 
are just grotesque curiosities when put into 
the passive voice [p. 290]." More strictly, 
there are some verbs that just do not take pas- 
sive forms. Excluded are not only intransitive 
verbs (e.g., sleep), which do not have direct 
objects, but also some other verbs (e.g., cost, 
resemble), which have direct objects (see 
Long, 1961, p. 119). Thus, Sentences 11 and 
13 cannot be formed from Sentences 10 and 
12, respectively. 


10. The book costs ten dollars. 

11. *Ten dollars are cost by the book. 

12. John resembles President Nixon. 

13. *President Nixon is resembled by John. 


Lees (1960, p. 8) has observed that verbs 
that do not take the passive do not take 
manner adverbials freely. On the basis of this 
observation, Chomsky (1965, pp. 101-106) 
has suggested that the passive marker in the 
deep structure be subordinated to Manner. 
However, this proposal must remain tentative 
until the critical adjective freely is more spe- 
cifically defined. Whether or not the restric- 
tion on the formation of the passive voice is 
related to other restrictions, the fact remains 
that the construction of the passive is con- 
strained. On the other hand, the main func- 
tion of the passive, that of reversing the noun 
phrases, can be accomplished, with some 
Verbs, in the active (Long, 1961, pp. 114- 
115). Evidence Pairs 14—15 and 16-17. 


14. The chairman 
promptly. 

15. The meeting began promptly. 

16. They ring the bells at six. 

17. The bells ring at six. 


began the meeting 


Finally, psychological studies have shown 
that the passive is more difficult to under- 
stand and to remember (DeVito, 1969; 
Huttenlocher, Eisenberg, & Strauss, 1968; 
Slobin, 1966; and the experiments reviewed 
in the next section). There is also some evi- 
dence that changes in the memory of sen- 
tences are more often from passive to active 
than vice versa (Mehler, 1963; Prentice, 
1966; see also Sachs, 1967, and Slobin, 1968, 
for some qualifications on this generaliza- 
tion). Developmentally, too, the active pre- 
cedes the passive (Gaer, 1969; Hayhurst, 
1967; Turner & Rommetveit, 1967a, 1967b). 
On the basis of the linguistic considerations 
and the psycholinguistic evidence we can 
clearly assign marked status to the passive 
and unmarked status to the active. This con- 
clusion is in accord with the general practice 
of grammarians to consider the active as 
basic. Long (1961, Ch. 5) has codified the 
unmarked status of the active by referring 
to it as the common voice. : 

Having established that the passive voice 
is marked and the active unmarked, it seems 
appropriate to ask what semantic functions; 
if any, are carried by this difference. This 
question will be considered in the next two 
sections in the context of a general discussion 
of the semantic difference between the tw? 
voices. 


Active-Passive Semantic Differences: Оп 
Contextual Contiguity and Emphasis 

Do the active and passive completely ove 
lap in meaning or are there any differences‘ 
Ziff (1966) asserted on the basis of examples 
that the two voices differ in meaning but 
offered no explicit definition of the difference- 
Chomsky (1965, Footnote 3 on p. 223; 8—6 
also, Katz & Martin, 1967) also sensed that 
the passive is more than a stylistic variant 
of the active, but saw no way to identify ie 
difference within transformational theory aS 
then formulated. Traditional attempts tO 
Specify the functions of the passive volce 
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have pointed to one or more of the following 
considerations (Jespersen, 1964, pp. 120- 
121): The passive voice is used when the 
logical subject is unknown, cannot be easily 
stated, or is self-evident (e.g., He was killed 
in the war. I was tempted to start dancing.). 
The agentless passive is very common. In 
fact, Jespersen estimated that over 70% of 
the passives in the works of English writers 
belonged to this category. Svartvik (1966, 
D. 141) has reached a similar conclusion. 
When the agent is provided, the passive is 
used either because the logical object attracts 
greater interest than the logical subject and 
hence is given greater emphasis and promi- 
nence by making it grammatical subject, or 
for stylistic purposes to "facilitate the connex- 
lon of one sentence with another: He rose to 
Speak, and was listened to with enthusiasm 
by the great crowd present [Tespersen, 
1964, p. 121].” The emphatic and stylistic 
functions of the agentful passive often co- 
incide as they both tend to reflect the the- 
matic interests of the speaker. The theme of 
the discourse receives expression by being 
placed in grammatical subject position, thus, 
giving it emphasis and providing discourse 
unity, 

Linguistic treatments of voice normally 
Concentrate on the functions of the passive 
Voice, reflecting the position that no reason 
needs to be given for the use of the active 
Voice beyond the need to convey informa- 
tion. We will now turn to a review of the 
Psychological literature on semantic differ- 
ences between the active and passive voices. 
The Psychological studies have generally dis- 
Cussed the differences between the two voices 
Without much attention to the asymmetric 
Character of the relation between them. But 
& reexamination of the results reveals, as will 
be seen below, differences to be expected 
from the marked-unmarked relation between 
the passive voice and the active voice. 

Psychological investigators, without much 
reference to the linguistic thinking in this 
domain, have also concentrated on the same 
two functions in their exploration of the 
differences between the voices: the contextual 
Contiguity function and the emphatic func- 
tion, Tannenbaum and Williams (1968a) 
found that the response latency in forming a 
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sentence to describe a simple line drawing 
(¢.g., a train hitting a car) was longer when 
the subjects had to describe the drawing in 
the passive voice than when they had to 
describe it in the active voice. The differences 
between the two voices widened when the 
description task was preceded by a preamble 
about the logical subject of the picture (in 
above example: a discourse about trains). 
On the other hand, the difference shrank 
when the preamble was about the logical ob- 
ject (about cars). These findings suggest that 
the response readiness of one voice or another 
can be primed through the manipulation of 
the topic of discourse, In the absence of 
such intervention, however, the active voice 
seems to be ina steady ready-state, as would 
be expected from its unmarked status, 

In another study, Tannenbaum and Wil- 
liams (1968b) found that the logical subject 
was a better cue for the recall of the verb 
of an active sentence previously presented, 
whereas the logical object was a better cue 
for the recall of the verb of a passive sen- 
tence. The difference between the two types 
of cues was, however, more than twice as 
large for the passive as for the active, In 
view of the marked status of the passive sub- 
ject and object, it seems reasonable that the 
difference between them should be weightier 
than the difference between their unmarked 
counterparts in the active. Parallel results 
were obtained by Turner and Rommetveit 
(1968) in a study with children using pic- 
ture cues, rather than word cues. They found 
that pictures of logical objects were more 
effective prompts for the recall of passive 
sentences and pictures of logical subjects 
better prompts ‘for the recall of active sen- 
tences. In this study, too, the difference be- 
tween the two types of cues was proportion- 
ately larger for the passive voice than for the 
active voice. These studies extend the pre- 
vious Tannenbaum and Williams findings in 
showing that individuals are inclined to begin 
a sentence with what is contextually promi- 
nent (here the cueing noun phrase or picture) 
and that the passive is more sensitive to con- 
textual variation than the active, Using an 
entirely different method, Prentice (1966) 
obtained results supporting both conclusions. 
She found that for both actives and Passives 
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it was easier Lo learn to associate a sentence 
to an extrasentential noun when the first noun 
of the sentence (the grammatical subject) 
was a high associate of that noun than when 
the last noun of the sentence (the grammati- 
cal object) was a high associate. But the 
difference in ease of learning between the two 
conditions was much larger for the passive 
than for the active. The tendency for con- 
textual continuity seems also to be reflected 
in Wright’s (1969) finding that the recall 
of a sentence 5 seconds after presentation 
was more accurate when the question about 
it was in the same voice as the initial sen- 
tence than when the sentences and questions 
were mismatched. The asymmetry between 
the active and the passive is also reflected in 
this experiment. The detrimental effect of a 
passive question about an active sentence was 
considerably larger than the detrimental effect 
of an active question about a passive sentence. 

The experiments reviewed so far show a 
tendency for congruence between the gram- 
matical subject and the preceding context, 
thus suggesting that the grammatical sub- 
ject is selected to reflect what is contextually 
prominent. The studies to be discussed below 
attempt to assess directly the emphatic func- 
tions of the grammatical subject. These ex- 
periments also indicate that the passive is the 
one that is particularly adaptive to the con- 
text; the active is normally available for use 
unless contextual weight suggests the choice 
of the passive. Two studies by Johnson-Laird 
(1968a, 1968b) were guided by the notion 
that the function of the passive is to make 
the logical object more prominent by placing 
it in the position of the grammatical subject. 


His sentences had the form Blue follows 


(or 
precedes) red and Re 


d is followed (or pre- 
ceded) by blue, 'The subjects' task was basi- 
cally to find or express correspondences be- 
tween these sentences and colored strips. The 
findings indicate that the grammatical subject 
(in above example: blue for first sentence 
and red for second) was assigned a larger 
area in the strip than the grammatical ob- 
ject (red and blue, respectively, for first and 
second sentence). Also, the difference between 
Size of subject and object was larger for the 
Passive than for the active, reflecting again 
the marked status of the passive. These find- 


ings support the idea that the function of the 
passive is to emphasize the logical object by 
making it the grammatical subject. However, 
the generalizations that can be reached from 
this study are rather limited because only 
two nouns and verbs were used (and atypical 
ones at that) and because of the suggestive 
nature of the task. 

Clark (1965), working within the informa- 
tion theoretic framework, has pointed to the 
higher interest level of the grammatical sub- 
ject (the logical object of the passive) than 


of the grammatical object. In particular, he 
found: 


The beginning of both grammatical forms tends t9 
be more animate and to have less uncertainty than 
the end. . . . In ordinary terms, people put what 
they want to talk about (usually animate things 
of little uncertainty) in the beginning of the sen- 
tence [Clark, 1965, p. 369]. 
Johnson (1967) seems to have provided addi- 
tional support for Clark's conclusion. . 
The psychological studies have thus vali- 
dated the two functions of the agentful pas 
sive, intuitively postulated by linguists: (a) 
to provide discourse continuity and (0) t° 
impart prominence to the logical object БУ 
placing it in grammatical subject position. 
These intuitions and findings have to be 
reconciled with the suggestion made by Mi- 
hailovié (1963) that the logical subject, n0t 
object, is really the emphasized one in pip 
passive because it receives heavy stress: 
“When the actor is shifted to final position: 
primary stress falls on it, and makes it Ven 
prominent [p. 81].” This suggests that ther? 
may be different senses in which the logic? 
object and the logical subject are promine" 
in the passive voice. The prominence of the 
logical object has been identified via the 1Ê 
viewed studies in terms of contextual rela- 
tions. The logical object is shifted in th? 
passive to the position of grammatical дш 
ject because it represents the dominant inter 
est of the discourse or because it provide, 
for discourse continuity, In what sense is jn 
logical subject prominent in the passive? j » 
answer to this question will be attempted ? 
the next section through an explication ° 
the concept of presupposition. 
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Aclive-Passive Semantic Differences: On 
Presup position 

The notion of presupposition has some 
background in Philosophy (see Garner, 1971) 
and in linguistics (see Halliday, 1967) but 
has recently been elevated to a position of 
visibility and importance through the work 
of generative linguists. Presupposition occu- 
pied a central position in a recent linguistic 
conference (Fillmore & Langendoen, 1971) 
and it is involved in the controversy whether 
aspects of meaning not accommodated by the 
deep structure are a Property of the surface 
Structure (Chomsky, 1970) or whether all 
aspects of meaning are concentrated in a 
semantic base, bypassing the deep structure 
altogether (Lakoff, 1971; see also Chafe, 
1971). Our application of presupposition to 
the explication of semantic differences be- 
tween the two voices can, however, be pur- 
sued in a neutral manner. In what follows we 
Will first review the essential aspects of the 
Concept and then turn to its application to 
the semantic differentiation of the two voices. 

The notion of presupposition entails a dis- 
tinction between two aspects of sentential in- 
formation: the explicit, new, or illocutionary 
and the implicit, given, or presupposed, In 
philosophy, the interest centered on the pre- 
Supposed existence of definite names involved 
in sentences. Thus, in Sentence 18 the present 
existence of a king of France is presupposed 
as given and his baldness asserted as new 
information. 


18. The present king of France is bald, 


Philosophers investigated the effects of fail- 
ure of presupposition on the status of sen- 
tences implying them (see Garner, 1971; 
Hutchinson, 1971). This issue need not con- 
cern us here and we can turn directly to the 
Work of linguists on presupposition. Fillmore 
(1971a, 1971b) has presented insightful 
analyses of the semantic composition of lexi- 
cal items in terms of presupposition. Consider, 
for instance, Sentence 19. 


19. Please shut the door. 


Because of the verb sut, the sentence implies 

the Presupposition that the door is open. The 

Same Presupposition is implied by Sentence 
0, which is negative. 


20. Please don't shut the door. 


Or compare the verbs accuse and criticizc in 
Sentences 21 and 22, respectively, 


21. John accused Harry of writing the letter, 
22. John criticized Harry for writing the 
letter. 


Both sentences convey the same information, 
that John thought Harry did something repre- 
hensible by writing the letter. But whereas 
Sentence 21 presupposes that writing the 
letter was blameworthy and asserts the asso- 
ciation (in John's Statement) of Harry with 
this act, Sentence 22 presupposes that Harry 
wrote the letter and asserts the negative 
evaluation of this act. Thus, what is pre- 
supposed as given in Sentence 21 is asserted 
as new in Sentence 22 and vice versa. 

Other lexical items, apart from verbs, 
also carry presuppositional information (see 
Keenan, 1971), but our main interest here 
is in the analysis of Presuppositions carried 
by sentential structure. Chomsky (1970) has 
based his Presuppositional analysis of syntac- 
tic structures mainly on the examination of 
questions. He compares, for instance, Sen- 
tences 23 and 24. 


23. Did John give the book to Bill? 
24. Did John give Bill the book? 


Under normal intonation, the last word in 
each of these sentences receives major pho- 
netic stress, Chomsky Proposes that the 
phrase containing the intonation center be 
considered the assertional focus of the sen- 
tence and the Presupposition of the sentence 
be defined as the Proposition resulting from 
the replacement of the focus by a variable. 
Thus, the Presupposition of Sentence 23 
would be that John gave the book to someone 
and that of Sentence 24 would be that John 
gave Bill something. The postulated differ- 
ence in the presuppositional structure of the 
two sentences is reflected in the difference in 
status of Responses 25 and 26. 


25. No, to someone else. 
26. No, something else. 


Response 25 appears more natural for Sen- 
tence 23 and Response 26 More natural for 
Sentence 24 because in this Pairing the re- 
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sponses share the presuppositions of the ques- 
"ons: But 26 as a response to Sentence 23 
and 25 as a response to Sentence 24 appear 
less natural because they deny the presupposi- 
tions of their respective questions. Contras- 
tive intonation can change the focus and pre- 
supposition of a sentence. Thus, 26 is a nat- 
ural response to Sentence 23 when book is 
heavily stressed in Sentence 23. However, 
the same principles hold for contrastive in- 
tonation as for normal intonation and our 
discussion will evolve around the latter case. 
'The essence of Chomsky's proposal is that 
the phrase containing the most heavily 
stressed word is the focus or core of the new 
information asserted by the sentence; the re- 
maining information in the sentential frame 
is considered to be presupposed as given. 

Halliday (1967-1968), although working 
from a different perspective, has discussed the 
notion of presupposition in much the same 
terms as Chomsky (1970), but with an im- 
portant addition. He points out that every 
syntactic structure contains new information 
but the presupposition (the “given” informa- 
tion, in his terminology) is optional for un- 
marked structures. That is, in unmarked 
cases, the information focus may extend over 
the entire sentence. 

With this background on presupposition 
we can turn to an examination of presupposi- 
tional differences between the active voice and 


the passive voice. Consider a passive sentence 
as in 27 below. 


27. The principal had been angered by the 
parent. 


Primary stress is assigned to parent making 
the phrase the parent the focus of the asser- 
tion. The presupposition, then, is that the 
principal had been angered by someone. In 
other words, this sentence presupposes as 
known or given that the principal had been 
angered, and it provides the new information 
that it was the parent who was responsib!e 
for this. This distribution of focus-presupposi- 
tion as inferred from the stress pattern is 
supported by a consideration of the choices 
the speaker could have made, but did not 
make, in expressing his message. If he wanted 
to impart just the information about the 
anger of the principal he could have used the 
more common agentless passive, saying: The 
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principal had been angered. By adding the 
by-agent, the speaker has focused attention 
on it, making it the core of the assertion. — 

What about the active voice? On the basis 
of stress consideration alone, the active, like 
the passive, divides into an assertional and 
presuppositional component. Thus, consider 
Sentence 28, the active counterpart of Sen- 
tence 27, as an example. 


28. The parent had angered the principal. 


Primary stress makes the principal the focus 
of the assertion, and its replacement by a 
variable leaves the presupposition that the 
parent had angered someone. By this analy- 
sis, the noun phrase that enters into the 
assertion in the passive is part of the presup- 
position in the active, and vice versa. How- 
ever, we saw that the semantic differentiation 
between the two voices with respect to the 
variables of stylistic continuity and emphasis 
was mainly due to the effects of these varl- 
ables in the passive, with the active more OF 
less serving as the neutral base, in accordance 
with its unmarked status. The markedness 
distinction would suggest that the passive 
should also carry the main presuppositional 
load, with the active being more or less pre 
suppositionally neutral. In fact, this is exactly 
what Halliday has proposed. He has noted 
specifically that the presuppositional inference 
is optional in the active voice (Halliday, 
1967, p. 215). He made the point that sen- 
tences like 29 below can be analyzed either 
as presupposing that John's seen something 
and asserting that it was the play or as lack- 
ing any presupposition, with the entire sen- 
tence functioning as an assertion. 


29. John's seen the play. 


The active thus offers the speaker a neutral 
form for conveying information, one he сап 
use when he does not want to emphasize oF 
to presuppose anything in particular. sat 
There has been virtually no psychologic# 
experimentation on presupposition, but 016 
of its psychological consequences is fairly 
obvious. Chomsky's (1970) discussion 16; 
viewed above suggests that it is *unnatura 
to deny presuppositions. This point underlies 


я ы ; 3 ike 
the interviewing effectiveness of questions lik 
30 below. 


30. When did you stop beating your wife? 


р. 
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The verb stop asserts the cessation of an 
activity; it presupposes its prior existence. 
Or take the noun bachelor. Its assertional or 
foreground meaning refers to a person who 
has never been married, its presuppositional 
or background meaning concerns an adult 
male. Fillmore (1971a) has pointed out that 
the predication in Sentence 31 involves the 
assertional meaning of bachelor, not its pre- 
Suppositional content, 


31. That person is not a bachelor. 


There is also experimental evidence that 
individuals are less likely to notice events 
inconsistent with a presupposition than they 
are to notice events inconsistent with an 
assertion. Hornby (1971) presented subjects 
with cleft and pseudocleft sentences as in 
32-35, 


32. It is the girl that is kicking the pumpkin. 

33. The one that is kicking the pumpkin is 
the girl. 

34. It is the pumpkin the girl is kicking. 

35. What the girl is kicking is the pumpkin. 


Both Sentences 32 and 33 presuppose that 
Someone is kicking the pumpkin and assert 
that that someone is the girl, whereas Sen- 
tences 34 and 35 presuppose that the girl is 
kicking Something and assert that that some- 
thing is the pumpkin, Following the presenta- 
tion of each sentence, Hornby flashed briefly, 
in a tachistoscope, a picture misrepresenting 
either the presupposition or the assertion of 
the sentence and requested the subjects to 
indicate whether the picture was a true or 
false depiction of the sentence. He found that 
Subjects gave more erroneous “true” responses 
When the picture misrepresented the presup- 
Position than when it misrepresented the 
assertion, Thus, for instance, a picture of a 
girl kicking a football elicited more true re- 
5ропзез for Sentences 32 and 33 than for 
Sentences 34 and 35. 

The implication of these observations and 
findings is that information presented on the 
Presuppositional level is more readily accepted 
and less likely to be questioned than asser- 
Чопа] information. In making this generaliza- 
Поп as well as any other ones about presup- 
Position, it must be kept in mind that not 
a Presuppositions are alike. In particular, 

Appears that there is a scale of strength of 


presuppositional implication. The presupposi- 
tions of some sentential Structures appear to 
be less definite than those of others (see 
Chomsky, 1970, Footnote 27; Geis & Zwicky, 
1971; Karttunen, 1971). Thus, for instance, 
the presupposition that somebody is kicking 
the pumpkin appears stronger in Sentence 32 
above than in the corresponding passive sen- 
tence, 36. 


36. The pumpkin is kicked by the girl. 


Perhaps the presuppositional message of the 
passive is weaker because it could have been 
chosen by the speaker to serve the functions 
of stylistic continuity and emphasis rather 
than the presuppositional function, 
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The log-linear model is described as a framework for analyzing effects in multi- 


dimensional contingency tables, that is, tabl 
variables of classification. V 


general purpose analysis is proposed for suc 


es of frequencies formed by two or more 


ariables are considered to have nominal categories. A 


h tables, similar to analysis of variance. 


Two test procedures are considered: (а) maximum likelihood estimation of expected 
cell frequencies and associated chi-square tests and (b) chi-square tests based on 


logarithms of adjusted cell frequencies. 
methods, related to the latter approach, ar 
native procedures. 


Since 1935, when the first article dealing 
with the estimation of interaction in multidi- 
mensional contingency tables was published 
(Bartlett, 1935), a large literature has accumu- 
lated concerning the problems of defining and 
testing complex relationships in such tables. 
A previous article in this journal (Sutcliffe, 
1957) presented the methods developed by 
Lancaster (1951) based on the partitioning of 
а total chi-square statistic in a way analogous 
lo the partitioning of a total sum of squares in 
the analysis of variance. However, the majority 
of statisticians working in this area have re- 
jected these methods on the grounds that when 
relationships among variables are defined in an 
intuitively acceptable manner, the partitioned 
chi-square values do not correspond to tests of 
these relationships (see, e.g., Goodman, 1964b). 

The purposes of this article are (a) to discuss 
definitions of relationships among variables in 
Contingency tables and (b) to describe prac- 
tical methods of testing hypotheses about the 
existence of effects of different degrees of com- 
plexity, The article is limited to the considera- 
tion of variables with nominal categories (i.e., 
Unordered categories), or where order of 
categories is to be ignored;? and it is con- 
cerned primarily with significance testing 
rather than estimation. 


sp Requests for reprints should be sent to Juliet Toppe 
ац Department of Psychology, University 

as, Lawrence, Kansas 66044. ар 
Vari ate that this implies essentially no restriction lor 
‘ables with two categories. 
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In addition, two multiple-comparison 
e considered as supplementary or alter- 


THEORETICAL DEVELOPMEN 


In a contingency table, observed entities are 
classified on the basis of a number of variables, 
each of which consists of a set of mutually ex- 
clusive and exhaustive categories. Each com- 
bination of possible variable values defines a 
cell of the table, and the entry in each cell 
equals the number of entities exhibiting that 
particular combination of values. In general, 
with m variables, the contingency table can be 
represented as an m-dimensional rectangle with 
cubical cells. 

In the literature, related but not identical 
distinctions have been made between variables 
with fixed and random marginal distributions, 
between independent and dependent variables, 
and among variables with different causal 
relationships to one another. These distinctions 
are considered as they become relevant in the 
course of the presentation. They affect the 
nature of the analysis considered here in that 
they bear on the appropriateness of testing 
various kinds of relationships, but the form of 
any test, if appropriate, is the same for all cases. 

Note that a g-factor analysis of variance de- 
sign would correspond to a (q + 1)-factor con- 
tingency table, since reference to the dependent 
variable is suppressed in analysis of variance, 
where all effects can be thought of as inter- 
actions between the д independent variables 
and the one dependent variable. Since all 
variables are represented explicitly in the con- 
tingency table, the frequency in each cell can- 
not be considered equivalent to a score; that is, 
it is not analogous to a cell mean in the analysis 
of variance that represents the mean score on 


( 
| 
\ 
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the dependent variable. Nonetheless, it is 
demonstrated that treating transformed cell 
frequencies as if they were scores produces a 
formal correspondence between contingency 
table models and analysis of variance models. 


Notation 


The variables in the contingency table are 
denoted A, B,C, ..., with aparticular category 
of A denoted A; where i = 1,..., a; of B de- 
noted Ву, where j = 1, ..., b; etc. Further 
notation is defined for a three-factor table; the 
extension to tables of other dimensionality 
should be obvious. The probability that an 
entity appears in the cell characterized by level 
i of A, level j of B, and level k of C is denoted 
pin, With У 27 Y рок = 1.4 Tables formed by 

+ J d 


summing фу over all categories of one or more 
variables are called marginal tables. The vari- 
ables summed over are represented by dots; 
thus, the entires in the АВ marginal table are 
the values of p;;., those in the A marginal table, 
the values of p;.., etc. These entries could be 
probabilities, fixed proportions, or some com- 
bination of the two, depending upon the 
method of controlling the variables in the ex- 
perimental situation. For example, if A repre- 
sented sex, and equal numbers of men and 
women were chosen for an investigation, the 
Pi. where i = 1, 2, would be fixed proportions, 
each equal to 3. On the other hand, if the total 
sample were chosen randomly from a popula- 
tion and sex were merely recorded, the pi.. 
would be probabilities. If the sample were con- 
trolled on other characteristics but not on sex, 
the р. would be averages of probabilities within 
subpopulations. The observed frequency in the 
cell characterized by level i of A, level j of B, 
and level k of C is denoted Nijx, with the total 
frequency, x r2 Ni, denoted by N. As- 
1 


suming a fixed value of N, the expected fre- 


2 Logit analysis and its extensions, which deal with 
an m-dimensional contingency table where one variable 
is considered the dependent variable and the other 
m — 1 are considered independent variables, are not 
considered separately here, since these analyses can be 
shown to be special cases of the general analysis that is 
presented (Bishop, 1969). 

* Summations and products will always be over the 
full range of possible values of the variables. Therefore, 


for Convenience, the range is not indicated in the sum- 
mation or product notation. 


quency in each cell, V piji i5 denoted Fi The 
dot notation used for probabilities is used 
similarly. for observed and expected. frequen- 
cies; thus, fj = x Rin, Ni. = йз р> Ад 
Л jk 
etc. Maximum likelihood estimates of the es- 
pected frequencies in each cell are denoted 
M ij, with the dot notation used as for № and 
E. Any term enclosed in parentheses refers to 
the set of such values in the entire table; for 
example, (№) denotes the complete set of 
marginal А В frequencies.? 


Fixed and Random Marginal Tables 


All methods presented are appropriate for 
tables in which (a) marginal distributions on all 
subsets of variables are random and (p) the 
marginal distribution of some subset of from 1 
to m — 1 variables is fixed. (In the latter case, 
all marginal distributions of variables con- 
tained in the subset are also necessarily fixed.) 
Test procedures are suitable for testing rela- 
tionships among variables not included in the 
subset with a fixed marginal distribution, and 
for testing relationships among variables of 
which some are and some are not in that sub- 
set. In general, they are not appropriate for 
testing relationships among variables which are 
all in the subset with a 


margina 
distribution. 


fixed 

Although many of the procedures are suit- 
able under more general conditions on margina 
tables, Condition a or b (set forth above) should 
be satisfied in a large majority of empirica 
studies. 


Log-Linear Model 


The log-linear model, with properties inves- 
tigated extensively by Birch (1963), Bishop 
(1969), and Haberman (1970), provides a satis” 
factory framework for a general analysis 0 
contingency tables. In. this model, the loga- 
rithm of a cell probability is expressed as the 
sum of a number of effects in an expression 
formally identical to that for the population 
value of a cell mean in an analysis of variance 
design. For example, in a three-factor design» 
the log-linear model can be written in the fot 


: — uu xi- 

Ext is assumed implicitly throughout that all m 
mum likelihood estimates and expected cell frequencit 
are greater than zero. 


| 
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lowing form: 


log Pain = o A vii ve; F vus F тз 
+ virt Toajk + СОЕТ [1] 


wherei = 1,...,a; = 1... 0; E 1, PARN 

c; with linear restrictions on the parameters 

identical to those in analysis of variance; 

for example, 3 i; = 3; т; = Ут = 0, etc. 
- - = 


t J 

Each hypothesis then can be expressed by set- 
ling the appropriate terms equal to zero; for 
example, the hypothesis A X B X C = 0 is 
equivalent to the hypothesis vz; = 0, for 
P—1,...,0:491,...,0; d = 1, 

In order to clarify the nature of the model, it 
is discussed first as it applies to contingency 
tables where each factor has two categories. 


#& 
asg 


Dichotomous Variables 


Two factors: The 2 X 2 table. In a 2X2 
table, the effects that can be considered are A, 
B, and A X B. Taking the latter first, the 
hypothesis that the interaction effect equal 
to zero in the log-linear model is equivalent 
to the hypothesis of independence of the two 
factors; that is, that фуу = pip. where i, 
J = 1, 2. The main effect of variable А can 
be defined either marginally or conditionally. 
The marginal main effect would be a function of 
the marginal proportions, and the hypothesis of 
Zero marginal main effect could be stated as 
ollows: Pi. = }, where i = 1, 2. This obviously 
Could be meaningful only if p;, were not a fixed 
Proportion, for any i. Alternatively, the main 
effect of 4 could be defined as some type of 
average of its effects within each level of В, 
Which are referred to as partial effects. The 
Зуро(ћеіѕ that а particular effect equals zero 
In the log-linear model for the complete table 
™plies an average partial definition of the 
fect. A definition of partial А effects consis- 
tent with the model is log (pi;/ps), although 
апу function of (p1;/p2;) would be sufficient. 

1€ average partial effect of A equals zero in 
the model if and only if È log (р/р) = 0. 
Three factors: The 2 X2 X2 table. Main 
“cls in the 2 X 2 x 2 table are defined in the 
“Me Way ac in the 2 X 2 table. With three 
Setos, cach main effect can be defined margin- 


eff, 
Sar 


ү Е 5 
ron MM anv base for pur- 
Doseg * logarithms can be taken to any base T 


Of the discussion in this section. 
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ally with respect to the two other factors, par- 
tially with respect to both, or marginally with 
respect to one and partially with respect to the 
other. The three two-variable interactions can 
each be of two types: marginal with respect to 
the third variable, or an average of the two 
partial interactions, each at one of the values of 
the third variable. The hypothseis of zero mar- 
ginal interaction between 4 and B can be ex- 
pressed in the following form: Pu = Pipis 
where 7, j = 1, 2. In order to define a zero 
average partial interaction, a measure of two- 
variable interaction and a specification of the 
averaging process is needed. The hypothesis of 
zero three-variable interaction, as in analysis 
of variance, is taken to be equivalent to the 
hypothesis that the two partial two-variable 
interactions are equal, for every pair of the 
three variables. 

To be consistent with the log-linear model, 
it is sufficient to define the measure of two- 
variable interaction as a function of the cross- 
product ratio piipos/ psp» in a 2 X 2 table, or 
Diicfosi/ Pis Pais for each of the partial inter- 
actions in a 2 X 2 X 2 table. Edwards (1963), 
Altham (1970), and Mosteller (1968) have 
presented compelling arguments for using a 
measure of association based on the cross- 
product ratio. The hypothesis that A X B = 0 
in the log-linear model is equivalent to the hy- 
pothesis that x log (Pupesr/pisrpeu) = 0. 


The hypothesis that the triple interaction is 
zero is equivalent to the hypothesis that the 
two partial interactions are equal or that 
(фапфәоз/ Pipa) = (рззрәзз/ Prepay). This 
definition of zero A X B X C interaction is 
equivalent to that given by Bartlett (1935), 
and has been defended from an information- 
theoretic point of view by Good (1963)? and 
from a Bayesian point of view by Lindley 
(1964). 

More than three dimensions: The general 2" 
table. The extension to more than three dimen- 
sions is straightforward. In a four-dimensional 
table, for example, the hypothesis of zero 
A X B X C X D interaction is the hypothesis 
that the two partial three-factor interactions 


7 Kullback and his coworkers, using information- 
theoretic concepts, arrive at a related but somewhat 
different general approach (see, e.g., Ku, Varner, & 
Kullback, 1971; see also Goodman, 1969), 1970, 
1971a, for discussion and criticism). 
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are equal for all combinations of three of the 
four variables. The three-factor interactions are 
measured as functions of an extended cross- 
product ratio; thus, the measure of three- 
variable interaction in a 2 X 2 X 2 table 
would be a function of (piifsufis / 
(Pinfonifusfss), and the hypothesis of zero 
four-factor interaction in a 2 X2 X2 X2 
table could be expressed as follows: 


(Prnp22uPr221p2 2 


General m-Dimensional Contingency Table: Fx- 
lension of Definitions 


In analysis of variance factorial designs, an 
A X B X C interaction is defined to be zero 
when it is zero for every combination of two 
levels of А, two levels of B, and two levels of 
C; an А X B interaction is defined to be zero 
when it is zero for every combination of two 
levels of А and two levels of B, etc. The same 
definitions apply in the contingency table. It is 
not necessary to examine all possible subtables 
in each case. A subset of subtables can be 
chosen such that if the required relationships 
hold within the subset, they hold for the set of 
all relevant subtables for each effect. A con- 
venient way of choosing a subset is to consider 
only combinations of two levels with adjacent 
numerical values. For the 1 X В X C inter- 
action in the three-factor table, then, it would 
be sufficient to consider the (a — 1)(b — 1) 
(c — 1) subtables formed by taking all com- 
binations of levels i and (i + 1) of A, where 
i21,...,0— 1; j and (j +1) of 2, j — 1, 
22, 0 — 1; and k and (k + 1) of C, where 
k=1,...,¢ — 1. For example, ina 2 x 2x3 
table with variables 4, В, and C, three 
2X 2X 2 subtables can be formed: (a) Ap, 
Ву», Cro; (b) Av, Bio, Cis; and (с) Ais, Bia, Cos, 
where the subscripts indicate the levels of the 
variable that are present in the table. However 
it would be sufficient to consider only the Sub- 
tables а and c. The definition of zero 
А X В XC interaction therefore can be ex- 
pressed as follows: 


| 
EX [PaaspPosnGenaan/ 
Pier asnPorojgan] 


for pex. 22 ps b—1: 


PS Ty уз 
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k= 1, ..., e — 1. Similarly, the 4 X B inter- 


action is defined as zero when 


log [pisePcirnusae/ розра) = 0. 


Relationship between Average Partial and Mar- 
ginal Effects 


Analysis of a marginal effect, using marginal 
contingency tables, should be considered only 
if the variables to be disregarded are randomly 
sampled; otherwise, the marginal table would 
not contain frequencies each of which could be 
regarded as s sampled from a single population. 
Although the analysis of variance analogues of 
average partial and marginal effects are always 
equal in a balanced design, the effects are gen- 
erally not equal in a contingency table, and the 
relationship between them is rather complex. 
Therefore, it is of some importance to have 
criteria for deciding which effects should be 
examined. This issue is considered in the con- 
text of a three-dimensional contingency table, 
with Factors -1, B, and C, where the A X B 
interaction is of interest. 

An analogy between average partial A X B 
interaction and partial correlation (рлв,с) and 
marginal 44 X B interaction and marginal cor- 
relation (pag) is helpful in understanding the 
issues in this decision. As in the correlational 
situation, it seems necessary to consider the 
meanings of the variables, and in particular, 
their possible causal interrelationships. Where 
there is no possibility that C could be an effect 
of A or B, the average partial 1 X B interac- 
tion would generally be more meaningful. Thus, 
if C were an independent variable in the ex- 
perimental sense, a set of experimentally 
manipulated conditions, then the average par- 
tial A X B interaction would have a clear in- 
terpretation. Similarly, if C were a variable, 
such as sex, race, or social class (for children), 
and if the interaction were between two be- 
havioral measures, the partial interaction 
would be indicated, since these background 
variables could in no way be functions of the 
behavior. 

On the other hand, if C were a variable that 
could itself be affected by -4 or B, such as 
another behavioral measure, holding C ete 
stant could distort the estimated relationship 
between A and B, as in the case of partial cor- 
relation. Simpson (1951) discussed some inter- 
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TABLE 1 


LoG-LiNEAR EXPRESSIONS OF HYPOTHESES IN A THREE-FACTOR CONTINGENCY TABLE 


Expression in log-linear model 
Hypothesis | 
| Equations af 
4 = 01 Y Y log pi — У log purje = 0 diasd 
1 k i 
B= 0| EE loin- У У log Pigans = 0 b—1 
* Ж d 
c=0 У Y log pij; — У X log pg = 0 є— 1 
Y 3 iy 
AB = 0| X log pi, + Y log pusngine — ¥ log Ру — X log perje = 0 (a — 1) (5 — 1) 
k k k k 
AC — 0| Y log pus 4- Y log Passa — X log рады) = У log boi = 0 (a = 1)(c 1) 
4 J i 3 
BC = 0| Y log pi, + log Pigs — X log Pausan — X log Pigans = 0 (b — 1)(с— 1) 
ABC = 0| log pc + log ралуу + log Psion + log pasngane — log Pausa (a — 1) (b — 1) (c — 1) 
— log pigne — log Proje — log pusoginusn = Û 


Note.—When equations require summing over i 
tions do not require summing over indices, their ran 


esting examples illustrating the issues just con- 
Sidered. In many empirical situations the 
choice between the analysis of average partial 
and marginal effects may not be clear-cut, and 
often both may be of interest. 

The conditions under which average partial 
and marginal interactions are identical are 
difficult to specify in general, but relatively 
easy to specify for two-factor interaction in 
a three-factor table. In particular, a zero aver- 
age partial A X B interaction implies zero 
Marginal 4| X B interaction if and only if 
either (a) A X B X € = 0, and A X C (aver- 
age partial) = 0; or (b) LX B X C — 0, and 
B X C (average partial) = 0. For a detailed 
discussion of the relationship between partial 
and marginal effects in general contingency 
tables, see Goodman (1970, 1971b). 


Functions of Cell Probabilities in the Log-Linear 
Model 


Just as in the case of population cell means 
in analysis of variance, hypotheses that speci- 
lied effects are zero can be expressed in terms 
Of constraints on cell probabilities in the log- 
inear model. ‘Table 1 gives the contingency 
lable model equations corresponding to each 
ypothesis which could be tested in a three- 
factor contingency table. HE log ред, log ра, 
Cle, are replaced by дал, ign, etc., where 
“iin represents the population mean of the de- 


pendent variable in cell 4 i; Bj, Cr in a factorial 
analvsis of variance design, all expressions 
would be appropriate to the factorial design. 
То extend the results to general contingency 
tables, а simple rule for generating the required 
equations is stated, First, consider the 2” sub- 
table consisting of Levels 1 and 2 for each fac- 
tor. Given any specific effect, the subscripts of 
b (or u) referring to variables represented in 
that effect will be designated relevant to that 
effect. For example, subscripts i and j are 
relevant to effect A X B; i and Ё are relevant 
to effect AX C; i, J, and Ё are relevant to 
A X B X C; etc. Take the sum of the log pije 
(or uj) over all the irrelevant subscripts. To 
decide whether each of these sums should be 
added or subtracted in the general expression, 
add the values of the relevant subscripts of each 
to obtain the relevant sum. If the effect is an 
interaction among an even number of variables, 
put a plus sign in front of each term with an 
even relevant sum and a minus sign in front 
of each term with an odd relevant sum. If the 
effect is an interaction among an odd number 
of variables, put a plus sign in front of each 
term with an odd relevant sum and a minus 
sign in front of each term with an even rele- 
vant sum. Set the expression thus generated 
equal to zero. For example, in the equation for 
A X B = 0 in Table 1, the relevant subscript 
sums of successive terms are 2, 4, 3, and 3, 
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respectively—the first two even, and the last 
two odd. 'The set of equations appropriate to 
all levels of the factors then can be obtained 
by replacing the relevant subscripts by i if 
equal to 1, or (i + 1) if equal to 2; 7 if equal to 
1, or (j + 1) if equal to 2; etc., according to 
the positions of the subscripts. 

The degrees of freedom for the test of each 
hypothesis using the techniques to be presented 
are the same as for the test of the equivalent 
analysis of variance hypothesis. 


PROCEDURES 


Two large-sample procedures for testing the 
significance of effects are presented: chi-square 
tests based on maximum likelihood estimation 
of expected cell frequencies and chi-square tests 
based on logarithms of adjusted observed fre- 
quencies. The sample sizes required for their 
appropriate use remain to be determined by 
future research. Corresponding exact small- 
sample tests, illustrated using the contingency 
table in Table 3, are presented in Shaffer 
(1972). 

In order to provide the rationale for these 
tests, it is convenient to begin by considering 
the distinction between hierarchical and non- 
hierarchical hypotheses. 


Hierarchical and Nonhierarchical Hypotheses 


In analysis of variance designs, each main 
effect and interaction effect is generally tested 
individually for significance. Another possibil- 
ity is to test simultaneously the hypotheses 
that a particular effect or set of effects and 
all higher-order interactions involving the same 
sets of variables are zero. This test is sometimes 
recommended in a multifactor analysis of vari- 
ance design where individual higher-order in- 
teractions are not of interest, or where one 
wishes to avoid finding chance significant 
higher-order effects Simply as a result of the 
large number of tests otherwise performed (see 
Anderson, 1968). A hierarchical hypothesis is 
defined as a hypothesis that a particular effect 
and all higher-order interaction effects in which 
the same combination of variables appears are 
zero; for example, in a four-factor table, the 
hierarchical hypothesis AX 8 —0 in- 
cludes the individual hypotheses 4 X B = 0, 
AXBXC=0, AXBXD=0, 


and 
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FXBXCXD-0, implying, therefore, 
that all partial A X B interactions, as well as 
their average, are equal to zero or, in other 
words, that A and В are independent at all 
levels of C and D. Another hypothesis of pos- 
sible interest in a four-factor problem is the 
hypothesis that all interactions of three or more 
factors equal zero. To test this would require 
a simultaneous test of the hypotheses 
AR В Съо, XXBDxD- AXC 
XD=0,BXCXD=0, and AXBXC 
X D = 0. These form a hierarchical set; for 
ct equated to zero, all higher-order 
tions involving the same combination of 
variables are also equated to zero. 

While each chi-square procedure can be used 
to test both hierarchical and nonhicrarchical 
hypotheses, the method based on expected cell 
frequencies is more easily adapted to testing 
hierarchical hypotheses, while the method 
based on logarithms of observed cell frequencies 
is relatively simpler in testing nonhierarchical 
hypotheses. Therefore, each method is рге 
sented in the context in which it is most use 
fully applied. 


Hierarchical Hypotheses: Approximate Chi- 
Square Tests Based on Maximum Likelihood 
Estimation of Expected Cell Frequencies 


The chi-square test of a hierarchical hypothe- 
sis (or set of hierarchical hypotheses) pre- 
sented here utilizes a maximum likelihood est 
mate of the expected frequency in each cell 
given that the hypothesis is true. The hypothe- 
sis can then be tested using an ordinary Pearson 
chi-square test, which, in the context of a 
three-factor table, for example, involves cont 
putation of the following: 


" , same 
An alternative chi-square test of the yo 
hypothesis, based on the logarithm of the > i 
lihood ratio, requires the computation of 
following: 
" Мек 3 
х=? У у: 5 Na wou (s^ . p 
€ $ Е Mi 

s n ice Пу 
The two chi-square tests are asymptotlca 2. 
equivalent (Mood & Graybill, 1963). There |: 
no clear evidence that either has an advanta? 


у 
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over the other for a moderate number of ob- 
servations. The less familiar likelihood ratio 
chi-square test has been introduced. because 
tests of conditional hypotheses, described in 
à subsequent section, are more readily handled 
by the use of these chi-square tests. 

The extension of both tests to tables with 
more than three factors should be obvious. The 
number of degrees of freedom for chi-square 
equals the number of cells minus the number of 
linearly. independent restrictions employed in 
obtaining the estimated expected frequencies 
This number can be arrived at conveniently by 
treating the hypotheses as though they were 
the equivalent analysis of variance hypotheses 
and obtaining the degrees of freedom of the 
analysis of variance sum of squares that 
would be appropriate for the test of the com- 
posite hypothesis. Thus, in an ABC table, the 
hypothesis A X B X C = 0 would be tested 
approximately by a chi-square test with 
(a — 1) (b — 1) (c — 1) degrees of freedom; the 
hierarchical hypothesis 4 X В = 0, which 
would imply both xX B XC =0 and 
A X B = 0, would be tested by a chi-square 
test with (a — 1) (b — 1) (c — 1) + (a — 1) 
(b — 1) degrees of freedom, etc. Generally, the 
larger the set of individual hypotheses, the 
easier the determination of maximum likeli- 
hood estimates of expected frequencies. In 
Some but not all cases, explicit expressions for 
these estimates in terms of marginal frequen- 
cies can be obtained. 

The general procedure is illustrated first for 
à three-factor table. Table 2 presents every pos- 
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sible combination of hierarchical hypotheses in 
a three-factor table (except those that can be 
obtained by symmetry), the appropriate for- 
mulas for estimating expected frequencies, and 
the degrees of freedom for the chi-square 
test. In a three-factor table, the hypothesis 
A X BX C = 0 іѕ the only one that does not 
result in an explicit expression for estimates of 
expected cell frequencies. These estimates can 
be obtained in a straightforward manner only 
by solving a set of (a — 1) (b— 1) (c— 1) 
simultaneous fourth-degree equations (Good- 
man, 1964). 

As the number of v 


ariables increases, so does 
the number of hierarchical hypotheses that do 
not yield explicit expressions for maximum 
likelihood estimates of cell frequencies. Good- 
man referred to these as nonelementary, as 
opposed to elementary, hypotheses (Goodman, 
1970). As Table 2 indicates, only one of the 
cight types of hypotheses in a three-factor 
table is nonelementary, while in a four-factor 
table, 10 of the 27 types are nonelementary (see 
Goodman, 1970, p. 239, for a full list of hier- 
archical hypotheses in a four-factor table) 
the number of factors increases, the comple ty 
of straightforward estimation also increases. 
The adaptation, during the 1960s, of a rela- 
lively simple iterative method to obtain these 
expected frequencies marked a real break- 
through in providing practical methods for 
analvzing multidimensional tables, The method 
is based on the fact that when maximum like- 
lihood estimates of expected frequencies under 
a given hierarchical hypothesis are substituted 


TABLE 2 
HiRARCHICAL HYPOTHESES IN A TitREE-lACTOR TABLE 
Hierarchical hypotheses Effects equal to zero Expression for Mig |. df 
А =й A, AXB, AXC AXBXC | Na (a — tybe 
4=0, -0 А, В, AXB, AXC, ВХС, Ny ab (ab — Tir 
AXBXC Я | 
1 =Ü, Wie 0, € =0 A, B, C, A X B, AX C, BKC | Nabe abc — 1 
AXBXC | к 
4-0 BxC-0 A, A XB, ARC, A KERG Х.М.а abe — b + 1 
Bc | | 
AxXB=0 [2€ B, (a = Db = De 
АХВ=0,4хС=0 |4XB,. (a — 1)(be — 1) 
iX Ba Awe =0 IXB, [abc —a—h—e42 
BxC=0 IX BXC 
AXBXCLp HERL No esplicit на = ШИВ = ê — p 


expression 
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TABLE 3 


CONTINGENCY TABLE ror Test or Hype "THESIS 
AXBXC=0 


for observed frequencies, marginal tables based 
on these estimated expected frequencies are 
the same as those based on the observed fre- 
quencies for all combinations of variables cor- 
responding to effects that are not as 
be zero according to the hypothe 

The approach is illustrated first for the test 
of the hypothesis А X B X C = 0 in the con- 
tingency table given in Table 3. Its extension 
to the general case then is indicated. Fortu- 
nately, the method is particularly simple when 
applied to elementary hypotheses, so that it is 
practical to apply it to find estimated Írequen- 
cies under any hierarchical hypothesis without 
the necessity of first determining whether or 
not it is elementary. 

Since the hypothesis implies only that 
ARB Se C equals zero, the marginal fre- 
quencies 4B, АС, and ВС are fixed; that is, 
the expected frequencies, substituted for the 
observed frequencies, will yield the same AB, 
AC, and BC marginal tables as the observed 
frequencies. (The 4, B, and C marginal tables 
are fixed as a consequence of having fixed AB, 
AC, and BC marginal frequencies.) Write out 
an initial contingency table with 1.0 in every 
cell. A series of iterations is used to change these 
preliminary values into expected frequencies 
(see Table 4 for an illustration of the pro- 
cedure), Each cycle, in this example, consists 
of three steps, one corresponding to each set of 
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fixed marginal frequencies: 


1. Adjust 1 B frequencies: Multiply the num- 
ber in each cell with level i of 41 and level j of 
В by the fixed A;B; marginal frequency. di- 
vided by the sim of the current numbers in all 
these cells. If this is carried out for each com- 
bination of i and j values, the new cell num- 
bers will have the correct AB marginal 
frequencies. 

2. Adjust АС frequencies: Using the new 
cell numbers obtained in Step 1, multiply the 
number in every cell with level i of A and level 
k of C by the fixed 1€, marginal frequency, 
divided by the sum of the current numbers in 
all these cells. Do this for each combination of 
i and Ё values; this step yields new cell num- 
bers with the correct 4C marginal frequencies. 

3. Adjust BC frequencies: Using the cell 
numbers obtained in Step 2, multiply the num- 


TABLE 4 
X B XC = О FOR THE 
E IN TAB 


OBTAINING (Mija) Gr 


Category 


ot A 
|B 1 | B=2 | B 1 
Initial table 
= —— — 
1-21 1 | 1 1 | 1 | i | 
1=2 1 1 ї |1 A 
Cycle 1, Step 1: Adjust 1B frequencies _ 
t= 1 [3333 | 4.007 | 3.333 | 4.667 3.333 | 4.667 
1=2 | 2.607 | 1.000 | 2.667 | 1.000 | 2.667 | 1.00 
Cycle 1, Step 2: Adjust .1C frequencies _ 
- ; — 
4 = 1 | 1.667 | 2.333 | 4.167 | 5.833 4.107 m 
1 = 2 |4364 1.636 | 2.182 | 818 | 1.455 | 546 
| = —€— À 
Cycle 1, Step 3: Adjust BC frequencies А 
- ti m 
А =1 | 1.382} 2.039 | 3.282 | 7.016 5.030 | 3.058 
122 ЕЕ | 1718! 981| 2.070! 342 
Cell entries after 7 cycles: меч 
1 =1 | 975 | 3.027 | 2881 | 7.149 | 6.178 | 3.822 
й=2 „шл. 851 | 1.822 | .17ё 
x = 6.50, lT = 2, р = O4. 
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ber in every cell with level jof B and E of C by 
the fixed В,С, marginal frequency, divided by 
the sum of the current numbers in all these 
cells. If this is done for cach combination of j 
and & values, the new cell entries will have the 
correct BC marginal frequencies. 


Steps 1-3 constitute a single cycle in the 
iterative method. Tn general, each step results 
in cell entries satisfying the marginal restric- 
tions adjusted for in that step, but no longer 
satisfying marginal restrictions adjusted for in 
previous steps. If the original cell entries yield 
Proportions satisfying the hypothesis to be 
tested (this can be assured for hypotheses 
specifying zero effects by taking all initial cell 
entries equal), then, at every step in each cycle 
the obtained cell entries yield proportions satis- 
fying the hypothesis. 

If the same cycle of three steps is repeated 
successively on the cell entries obtained at the 
end of each preceding cycle, the cell entries ap- 
proach the estimated expected frequencies, 
given the hypothesis. At the end of each cycle, 
the marginal frequencies obtained from the cell 
entries are compared with the required mar- 
ginal frequencies, When the two sets match 
Within a desired degree of accu racy, the process 
is terminated, and the cell entries at that point 
dre taken as the estimated expected frequen- 
cies, Apparently, a small number of cycles are 
Sufficient to obtain a highly accurate approxi- 
mation: 5-7 cycles have been found sufficient 
in most cases (Ku, Varner, & Kullback, 1971). 
In the example illustrated in Table 4, seven 
Cycles were required to obtain agreement 
Within .1% between each obtained and re- 
quired marginal frequency. 

In the case of an elementary hypothesis, 
Agreement between obtained and required 
Marginal frequencies is perfect (within limits 
of computational accuracy) at the end of the 
first Cycle.5 Thus, the failure to recognize an 
elementary: hypothesis and to use the explicit 
formula does not lead to any substantially 
Steater computational effort. 

The set of marginal frequencies to be fixed, 
that is, to be adjusted for, in the test of any 

ierarchical hypothesis or set of hypotheses in 
general contingency table, can be determined 


* This has been proved to hold for up to seven vari 
‘bles. Beyond that, no case is known where more than 
Wo eveles йге necessary. (see Haberman, 1970) 


Ss 
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by following a procedure that is illustrated here 
for a particular example. Suppose, in a four- 
factor table, a test of the pair of hierarchical 
hypotheses 4 X В = 0 and BXCXD =0 
is desired. The full set of hypotheses is, then, 
AXB=0,A XBXC=0,A XBXD=0, 
A XBXCXD=0, BXCXD=0, The 
marginal tables to be fixed can be obtained as 
follows: the full set of effects is written out and 
all those assumed to be zero are eliminated 
(indicated by line drawn through). All marginal 
totals that are fixed automatically as a con- 
sequence of fixing other marginal tables are 
enclosed in parentheses, (For example, if the 
ACD marginal totals are fixed, the AC, 4 D, 
CD, A, C, and D marginal frequencies are 
fixed automatically as а result.) The remaining 
marginal frequencies (circle) constitute the 
set that must be adjusted for in determining 
the expected frequencies. For this example, the 
list would look as follows: (A), (B), (C), (D), 


4B; (AC), (AD), (вс) (вр) (CD), ABE 
ABD, (cD) Hen, BED 


For another example, to test the hierarchical 
hypothesis 4 X B = 0 from the data in Table 
3, only Steps 2 and 3 as enumerated above 
would be carried out, since the AB marginal 
frequencies would not be fixed in testing this 
hypothesis. Since 4 X B = 0 is an elementary 
hypothesis, only one cycle would be required; 
the entries obtained at the end of the cycle 
would be the same as those which would be ob- 
tained using the explicit formula in Table 2. 

The general iterative method illustrated can 
be applied for testing any hierarchical hypothe- 
sis. Haberman (1970) and Goodman (1971b) 
give methods for simplifying the iterative 
procedure in some cases by taking advantage of 
particular characteristics of the hypotheses 
being tested. 

The ordinary chi-square test of the hypothe- 
sis A X B XC =0 in the illustrative con- 
tingency table, based on the estimated ex- 
pected frequencies arrived at in Table 4, vields 
a p value of about 04. As an approximation to 
the exact probability of .07, the chi-square 
value is an underestimate, but the agreement is 
fairly good, considering the extremely small fre- 
quencies. In the absence of any empirical 
investigations of the chi-square approximation 
in this contest, the general recommendations 
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of Cochran (1952, 1954) on sample size could 
be taken as a rough guide for determining when 
the chi-square test would be appropriate. 


Relationships among Hierarchical I ypotheses 
and between Full and Marginal Tables: Condi- 
tional Chi-Square Tests 


In a factorial three-factor analysis of vari- 
ance design, the sum of squares for testing the 
hierarchical hypothesis 4 X B = 0 (ie., the 
individual hypotheses A XB=0 and 
A X B XC = 0) with (a — 1) (b — 1) (c — 1) 
+ (a = 1) (6 — 1) degrees of freedom, minus 
the sum of squares for testing the hvpothesis 
AXBXC =0, with (a — 1) (b — 1) (c — 1) 
degrees of freedom, equals the sum of squares 
for testing the nonhierarchical hypothesis 
AX B = 0, with (a — 1) (b — 1) degrees of 
freedom. An equivalent relationship does not 
hold in contingency table analysis because the 
individual effects are not estimated orthogon- 
ally. However, Goodman (1969b) has shown 
that if likelihood ratio chi-square statistics are 
computed to test the hierarchical hypothesis 
A X B =0 and the hypothesis A X B X C 
= 0, the difference between them is distributed 
as chi-square with (a — 1) (b — 1) degrees of 
freedom if the conditional hypothesis 4 X В 
= 0 given thal AXBXC=0 is true. In 
other words, the difference between the two 
likelihood ratio chi-square statistics is an ap- 
propriate test statistic for testing the non- 
hierarchical hypothesis A X B = 0 assuming 
thal A X B X C — 0, but is not appropriate 
for testing that hypothesis if this assumption is 
not made. Goodman has extended this type of 
analysis to provide a complete partition of the 
ikelihood ratio chi-square statistic appropriate 
or testing the hierarchical hypotheses that all 
factors are independent (i.e., that all interac- 
tions are zero) into a set of likelihood ratio chi- 
square statistics, each of which is appropriate 
or testing a conditional hypothesis of this sort. 
Of special interest is the fact that these types 
of hypotheses provide a bridge between the 
analysis of full contingency tables and marginal 
ables. In an earlier section, for example, it was 
pointed out that the average partial interac- 
tion of two factors in a three-factor table is 
equal to their interaction in the marginal two- 
actor table if the triple interaction is zero and 
at least one of the other average partial two- 


factor interactions is zero. Thus, a test of the 
hypothesis that the two-factor marginal inter- 
action .1 X B = 0, carried out by a test on 
the marginal 1 X B table, is equivalent to a 
test of the hypothesis that the average partial 
A X B=0, given that A X B X C =0 and 
either 4 X Cor B X C equals zero, carried out 
on the full contingency table. For extensions of 
this reasoning to higher-order contingency 
tables, see Goodman (1970, 1071a), who makes 
use of these relationships to build complex 
models through stepwise procedures somewhat 
analogous to those used in regre i 

One could claim that hierarchical hypotheses 
or conditional hypotheses of this type are more 
meaningful than the unconditional nonhier- 
archical hypotheses. For instance, in a three- 
factor table, the interpretation of a significant 
A X B interaction is certainly more compli- 
cated if there is A X B X C interaction. There- 
fore, it could be claimed that one should first 
test A X B X C, and if the effect is not sig- 
nificantly different from zero, it should be as- 
sumed to be zero, and the test of the condi- 
tional hypothesis A X B = 0, assuming 
A X B XC = 0, should be carried out. If the 
A X B X C interaction is significant, interpre- 
tations should proceed at this level without 
performing tests of the simpler relationships. 

While this procedure may be appropriate 
under some circumstances, particularly with a 
small number of factors, it will often be useful 
to test. the nonhierarchical hypotheses, since 
the simpler relationships are often much greater 
in magnitude than the more complex higher- 
order interactions, even if the latter do reach 
statistical significance. Under these circum- 
stances, the simpler relationships can usually 
be interpreted without excessive qualification. 
Furthermore, the assumption that a higher- 
order interaction is zero when the appropriate 
lest statistic does not reach statistical signill- 
cance is often in error, and as a result, the test 
of the conditional hypothesis concerning үн 
simpler effect is biased to an unknown dure 
The analysis of individual nonhierarchica 
hypotheses, as in analysis of vi lance, is bii 
fore recommended in general as a first ayer 
analysis, except that hierarchical hypotlitses 
dealing with higher-order interaction effects 
may be tested simultaneously, as discusset 
previously, 
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Nonhierarchical Hypotheses: Tesis Based on 
Logarithms of Adjusted Cell Frequencies 


In the log-linear model, Equation 1, log руу» 
is expressed as the sum of a number of effects. 
If the total number of observations in the con- 
lingency table is taken as a fixed number, V, 
note that the model and the hypotheses ex- 
pressed in terms of the model can be identically 
expressed in terms of logarithms of expected 
frequencies Fun, where Ек = Хра. The 
formulation in terms of expected frequencies is 
utilized in the discussion to follow, as it is some- 
what more convenient in developing tests. 

The assumptions made in analysis of vari- 
ance models of independence, normality, and 
homogeneity: of variance of observed cell means 
as estimates of population cell means cannot 
reasonably be made with respect to estimates 
of logarithms of cell probabilities or logarithms 


of expected cell frequencies in the log-linear 
model. However, for large values of JV, if linear 
functions of logarithms of expected cell fre- 
quencies are estimated by the corresponding 
functions of logarithms of observed cell fre- 
quencies, the distributions of these estimates 
can be closely approximated by treating the 
logarithms of observed cell. frequencies as 
though they were values of independent, nor- 
mally distributed random у iables, with the 
variance of each equal to the reciprocal of the 


cell frequency." These asymptotic properties 
can be utilized to construct significance tests 
of each effect individually (i.e., nonhierarchic- 
cally), as in an analysis of variance design. 
The approach is described first in the con- 
text of a 2 X 2 X 2 table. The hypothesis of 
zero three-factor interaction is expressed in 
terms of logarithms of cell probabilities in 
Table 1; it can be expressed in terms of loga- 
rithms of expected frequences FE;j, for two 


* Asy mptotic variance statements apply to natural 
logarithms. Therefore, these test statistics, as given, 
Tequire the use of natural logarithms. | | 

1 Similar asymptotic properties hold for the fre- 
quencies themselves, and tests based directly on the 
"equencies can be developed (Goodman, 1964b). How- 
Ever, these tests are not symmetric with respect to the 
Numbering of categories, and therefore only the tests 
Sed on logarithms will be presented here. 


levels of each factor, as follows: 
log Еп + log Ess + log Ёз» + logEssi 
— log Eus — log Б — log Fon 
— log Es = 0. [4] 


Similarly, the expression for zero average par- 
tial A X B interaction is as follows: 


È log Ens + У log Z 
k k 


= log Er 
k 


— 5 log Fay, = 0. [5] 


k 


In other words, cach hypothesis is equivalent 
to a statement that a single contrast among the 
log Eijs is equal to zero. 

Rather than using the logarithms and recip- 
rocals of the sample frequencies themselves to 
estimate the terms and variance of the con- 
trast, a better estimate is obtained by adding 
2 to each observed frequency before comput- 
ing these functions (Goodman, 1970). This will 
also allow estimates when there are obtained 
frequencies of zero, for which logarithms and 
reciprocals are not defined, although if the fre- 
quencies are generally small, the methods of 
this section do not apply. Using Naj" to repre- 
sent Nix + 5, the adjusted cell frequencies, 
the sample estimate of the A X B X C inter- 
action !! js 


авс = log Nm* + log Nass* + log Nor" 


+ log Nsn* — log Nn* — log Л7»* 


— log Nan* — log Neg 


and the sample estimate of its variance is the 
sum of the estimated variance of each term, or 


Vase = 25 У Y (1 Nije"). 


Then Sasc/ VV анс, or the sample estimate di- 
vided by its estimated standard deviation, is 
approximately a unit normal deviate, and the 
significance of AXBXC interaction can 
be determined by referring the ratio to tables 
of the unit normal distribution. Equivalently, 


H As pointed out, it is necessary to assume only that 
the measure of an effect is any function of the appropri- 
ate ratio of products of probabilities. For ease of exposi- 
tion, these tests will be described as though the loga- 
rithmic function had been chosen. The tests are valid, 
however, regardless of the particular function used to 
measure the degree of the effect, so long as it is a func- 
tion of the appropriate ratio. 
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the ratio S'4sc*/V asc is distributed approxi- 
mately as chi-square with one degree of freedom 
if#AXBXC=0. Similarly, the average par- 
tial A X B interaction can be tested by 
computing 


P» log Nin* + У log A 
2 k 


Sa = - log Хаце — > log Мам) 
Ya sru DW 


Which is distributed approximately as chi- 
square with one degree of freedom if this effect 
15 Zero. 

The approach can be extended easily to test 
any effect that can be specified by a single 
equation. This would apply to all effects in 
a 2” table, and all effects estimated with one 
degree of freedom in any table. The equation 
specifying each such effect sets a linear func- 
tion of logarithms of expected cell frequencies 
equal to zero. Compute the sample estimate of 
this function by replacing expected cell fre- 
quencies by adjusted cell frequencies .V *, Com- 
pute the variance of the function assuming that 
the sample estimates for the cells are inde- 
pendently distributed with variances equal to 
the reciprocals of the adjusted frequencies. The 
Square of the sample estimate divided by the 
variance is distributed approximately as chi- 
square with one degree of freedom if the 
hypothesis is true. 

The method can be applied without modifica- 
tion to test a hypothesis specifying a fixed value 
for any linear function of the logarithms of the 
cell probabilities or expected frequencies, For 
example, in a three-factor table, any hypothesis 
of the form У У Y; бєк log Eir = lo can be 

y wg Ж] 


tested by forming the ratio (S12/V1), where 


З= > уу; {к log Nij,* — f 
i j k 


and 


Vu-2 LE PCin/N; 
dn 


The ratio is distributed as chi-square with one 
degree of freedom if the hypothesis is true, 
Applying the procedure to test the hypothe- 
sis. X B = 0 in the 2 X 2 X 3 table previ- 
ously considered, the resulting values are 
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as follows: Say = — 4.007, Van = 6.309, 
(San*/Vag) = 2.55, р = 11, The p value 
differs considerably from the value derived 
from the exact test, .23. With such small cell 
frequencies, this approximate method is clearly 
inappropriate. 

The method is very simple when the hy- 
pothesis to be tested is equivalent to the state- 
ment that a linear function of the logarithms of 
the expected frequencies has a specified value, 
but is considerably more complex when the 
hypothesis implies specific values for a set of 
such functions. The larger the set, in general, 
the more laborious the procedure becomes. This 
problem arises both in testing hierarchical 
hypotheses and in testing nonhierarchical hy- 
potheses concerning effects that are estimated 
with more than one degree of freedom. In gen- 
eral, under these conditions, the method based 
on logarithms of adjusted cell frequencies 
becomes analogous to a multivariate analysis 
of variance. There seems to be no point in con- 
sidering it for testing hierarchical hypotheses, 
since the method based on maximum likelihood 
estimation is simpler in that case. At any rate, 
it would be applied in the same Way as in test- 
ing a nonhicrarchical hypothesis. 

The hypothesis to be tested should be speci- 
cified in terms of a minimum set of functions 
of logarithms of expected frequencies (or cell 
probabilities) that are assumed equal to zero, 
as detailed for the three-factor problem in 
Table 1. Consider any s such functions, for 
example, for a three-factor table 


E i. 4 б log Ig — bho, 
Tis 

25 2. 25, bom log E — d, ees 
$ d & 

25 25 5 Ck log Ik — (so. 

Y 4 o B 


Lı 


L, 
The estimated values from the sample would 
be 5, So, ..., Ss, where 5, = La x Hos 

FOX €. x 
X log Vin* — fu, S2 = 3 Z Y. бид log Nist 
aka 


— (20, etc. The estimated variances would be, 
respectively, 


Vu = 2 25 = Gigi IN in”, 
E 7 Е 

>, Ж, 2. [m Дө, 

i j k 
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etc., and the estimated covariances would be 
К» = Va = pa 2, 2. [o Vik, 
5 j k 


Кы = Га = x EN [m Nut. 


£l aj 
Th 


cte. Let S be the column vector of estimated 
functions, that is, S = col (S1, айлы» ltd, Let 
V be the (symmetric) covariance matrix, that 


is, 


Then 


V2=S'V-1§ [5] 


would be distributed approximately as chi- 
square with s degrees of freedom if the hypothe- 
sis were true (Anderson, 1958). 

In general, the procedure requires computa- 
tion and inversion of a covariance matrix of 
order equal to the number of degrees of frec- 
dom. In many cases, it is possible to simplify 
the test so that inversion only of one or more 
Matrices of considerably lower order is required 
(for details sce Goodman, 19644). In particular, 
in testing the hypothesis that the highest-order 
interaction equals zero in a table where only 
one of the variables has more than two cate- 
Sories, it is unnecess ry to perform any matrix 
inversion. ‘The simplified procedure was used 
lo test the hypothesis .1 X 8 X C = 0 in the 
2X 2 X 3 table in which that hypothesis was 
tested using the chi-square test based on esti- 
mitted expected frequencies (Table 4). The 
obtained p value, .15, is not in good agreement 
With either the exact p of .07 or the p of .04 
based on the maximum likelihood chi-square 
test. The example was presented only for 
illustration; the cell frequencies in this instance 
аге too small for the appropriate use of this 
test, 


Logarithms of Adjusted Cell Frequencies and 
Malti ple-Comparisons Methods 


Two main multiple-comparisons methods are 
Presented here. They can be considered either 
45 alternatives to the test procedure using 
Equation 8, or, in the case of the first method 
at least, as supplementary to it. The first is an 
analog of the Scheffé procedure in analysis of 
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variance (Goodman, 1964a; Schefé, 1959), 
Consider, for example, the hypothesis 


AX BXC=0 ina three-factor table. This 
hypothesis can be specified by setting (a — 1) 
(b — 1) (c — 1) linear functions of the loga- 
rithms of the 77; equal to zero. Since, if the 
hypothesis is true, all these functions equal 
zero, it follows that all linear combinations of 
them equal zero. This includes measures of in- 
teraction in all 2 X 2 X 2 subtables of the full 
table, and averages of these. Let fanc indicate a 
single function of this class. Each such function 
can be expressed as a linear function of the КЁ, 
and thus 5; and V; can be computed, as dis- 
cussed previously. The probability is approxi- 
mately (1 — a) that, for all Ss / Vy «us 
(a — 1) (b — 1) (c — 1)*, where Ха} (a — 1) 
(b — 1) (c — 1)? refers to the chi-square value 
at the alpha significance level, for (a — 1) 
(6 — 1) (c — 1) degrees of freedom. In other 
words, if any function f can be found for which 


qn? Xei (а= 0 (6 — 1) (c 1), [9] 


the hypothesis 4X B XC = 0 can be re 
jected at the alpha significance level. The 
advantages and disadvantages of the Schefié 
procedure have been widely discussed in the 
context of analysis of variance. An additional 
advantage in the contingency. table context, 
when considering it as an alternative to the 
overall chi-square, is its extreme simplicity as 
compared to the overall test. One can look at 
the results and pick a function which appears 
most likely to be significant, and then test it 
using this procedure. Alternatively, only a 
certain set of functions may be of interest, in 
Which case each of them may be tested in this 
manner, keeping the overall significance level 
less than or equal to alpha. 

In the case where only a finite set of func- 
tions is of interest, another possible simultane- 
ous test procedure, which should be considered 
as an alternative to the overall chi-square test, 
is based on Bonferroni statistics (Goodman, 
19042; Miller, 1966). 1f, for example, a set of 
F functions is of interest and if each is tested 
individually at significance level a, F, the over- 
all level is < Sa F =a. (Actually, each 


E 
could be tested at a different level, as long as 
the sum over all functions were equal to alpha.) 
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Applied to the previous example, if f were one 
of F functions to be considered, it would be 
considered significantly different from zero if 


т; 7 Хат, [10] 
/ 

where Хале; 1? is the chi-square value at the 
a/F significance level for one degree of frec- 
dom. If any of the F functions were found 
Significant by this criterion, the hypothesis 
AXBXC=0 would be rejected, at the 
alpha significance level. 

If a specific set of functions is of interest, 
then 5,*/V , could be computed for each, and 
compared with either Ха; tabte- 
Xa/r; 1*. The relative power of the two pro- 
cedures, given a specific contingency table, 
would depend on the number of functions F. 
It would be legitimate in any specific situation 
to compute both chi-square criterion values 
and to choose the smaller one to obtain the 
more powerful test. 

A set of functions which might form a rea- 
sonable finite set for general purpose hypothe- 
sis testing would be the set of interaction effects 
in all 2 2 X 2 subtables. In that « F 
would be equal to 


= OF 


= a(a — 1)b(b — 1)c(c — 1)/(2 X 2X 2). 


This would be somewhat analogous to con- 
sidering differences between all pairs of means 
in a simple analysis of variance design. For the 
usual significance levels, .05 and 01, the 
method based on Bonferroni statistics generally 
gives a more powerful test than the Schefié- 
type alternative with respect to this set of 
functions (a useful table for comparing the 
two can be found in Goodman, 1969a, p. 11). 
This implies that the method based on Bon- 
ferroni statistics is more powerful than the 
overall chi-square test with respect to these 
functions, Considering the much greater sim- 
plicity of the Bonferroni tests, and the fact 
that the information gained may be more 
readily interpreted, this method can be con- 
sidered a serious competitor to the overall chi- 
Square test. Its advantages with respect to the 
latter are analogous to the advantages of the 
Tukey studentized range test with respect to 
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the F test, with the additional advantage of 
much greater ease of computation. 

Although the multiple-comparisons methods 
have been described in the contest of the test 
of 1 X B X C interaction in a three-factor 
table, the principles can be readily extended to 
test any hypothesis in a general table. 


CONCLUSIONS 


The most pressing requirement at this stage 
is research to indicate what conditions on 
sample size are necessary in order to get rea- 
sonably close approximations using the chi- 
square techniques, These conditions are likely 
to be considerably more complex than for two- 
factor contingency tables. Probability levels 
obtained by chi-square procedures should be 
compared with exact probability levels, based 
on conditional tests, and with unconditional 
probability levels derived from sampling from 
populations under various conditions. Modifi- 
cations to obtain more satisfactory approxi- 
mations with moderate sample size should be 
systematically explored; many variations have 
been suggested in the literature, 

Although continual refinement of methods is 
to be expected, the methods that have been 
presented are sufficiently. practical and broad 
in scope to constitute а highly useful set of 
techniques for the detailed analysis of multiple- 
contingency tables. 
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CORRELATION BETWEEN GENETICALLY. MATCHED 
GROUPS VERSUS RELIABILITY THEORY: 


A REPLY TO JENSEN 


JOHN K. MILLER 
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In a “Note on Why Genetic Correlations Are Not Squared" Jensen draws an 
analogy to classical theory of test reliability to explain why “geneticists do not 


square the correlations between twins . . . 


in order to obtain the percentage of 


variance explained by genetic factors [p. 223]." It is concluded, in light of theo- 
retical assumptions inherent in the statistical model, that the analogy is improper 
and that its acceptance leads to overestimation of variance attributable to genetic 


factors. 


Recently Jensen (1971) stated the case for 
the practice of interpreting correlations be- 
tween phenotypes (observed measures of 
heritable traits taken on genetically related 
subjects) as direct proportional measures of 
phenotype variance attributable to trait 
heritability. The issue here is methodological : 
Is the correlation between phenotypes properly 
interpreted as the proportion of observed 
variance attributable to a hypothetical mea- 
sure called the genotype? Genotypes are 
defined as the heritable components of traits 
whose development is subject to both genetic 
and environmental determinants. 

Jensen (1971) contends that 
where rp, is the correlation between pheno- 
types (5) and where r,, is the correlation 
between phenotype and genotype (g), imply- 
ing that rp» represents the ratio of genotype 
variance to phenotype variance (5/52). For 
support he appeals to an analogy with measure- 
ment theory. Classical test theory defines an 
individual observed score (X) by the equation 
X =! +e, the sum of a “true” score (À and 
an "error" component (е). A priori, / for each 
person is stable over successive measures of 
the same trait, while e is random and varies 
unsystematically from one occasion to the 
next. Given these assumptions, the proofs are 
simple and straightforward that both 7,, and 
Ti equal the proportion of observed variance 
due to "true" variance (S; 2). Because 
“errors” are assumed to be random, and there- 
fore uncorrelated with “true” scores (ri, = 0) 
or with one another (re. = 0), their contri- 
bution to test reliability is zero. Jensen treats 
genetic variance and. environmental variance 


Top = Pog, 


as analogous to true variance and error 
variance, respectively. 

The relation of a phenotypic measure like 
IQ to its hypothetical components, heritability 
(g) and environmental factors (Æ), is governed 
by the legitimacy of assumptions associated 
with partitioning p according to the definitional 
equation p = g + X. Environment’s contri- 
bution to the phenotype must be taken to be a 
normally distributed random variable such 
that (а) rj, = 0; (b) reg = 0; (c) and the 
average contribution of Æ to p = 0. The first 
two assumptions must be satisfied if genotype 
and environmental variance are not to become 
confounded in л. The third assumption must 
be satisfied if the confidence interval embracing 
with known probability an individual g is to 
be estimated. Failure of theoretical formula- 
tions to reflect these statistical assumptions 
constitutes an irreconcilable conflict that can 
be resolved only by altering the theory or by 
finding a more adequate means of statistically 
relating empirical observations to theoretica 
constructs. Let us review briefly the conformity 
of theoretical constructs proposed by Jensen 
(1968) to the statistical assumptions implicit m 
the theorem rp, = 72 = SSR 

1. rye = 0: In the absence of experimental 
control of environmental factors, psycho- 
logical measurement of genetic effects on 
human behavior is limited to the framewort 
of causal comparative studies. In such pr 
random assignment of members from (16 
same kinship pair to different groups prior A 
statistical analysis is possible and desirab м 
This procedure, however, contributes to p 
trolling only the effects of sampling bias attrib 
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utable to differences within pairs which could 
contaminate correlation between matched 
groups. It would most assuredly have no 
neutralizing effect upon covariance attribut- 
able to similar and systematic environmental 
influences upon phenotype performance by 
both members of the same kinship pair. As a 
consequence, the door remains open to a 
systematic contribution of environmental fac- 
lors to phenotype differences between pairs 
irrespective of the groups to which the members 
of each pair are assigned. Jensen (1968) in 
fact, has suggested elsewhere that there is 
“... probably a positive correlation between 
the quality of environment and potential for 
Intelligence Гр. 7]" within any population 
Sampled. It is precisely such environmental 
Similarities to which Jensen (1968, p. 7) 
appeals to explain agreement in findings among 
Studies based upon different populations. 
Postulating the existence of such a relationship 
contradicts the statistical assumption required 
lo permit the interpretation of rp» as a ratio 
of genetic variance to observed variance 
(S/S. Relaxing the assumption that 
Тов = 0, requires 5,2 = S,? + Sg? + 25,5, 
So that rpp exceeds SS by at least 
255 S p/S 2, The net effect will be the con- 
,Ounding of purely genetic components and the 
interaction between genetic and environmental 
factors in Tom 

2. rey = 0: The failure of the assumption 
of independence between genotype and en- 
Vironment implicitly weakens any assumption 
of independence between environments (even 
Separate environments) experienced by the 
two members of a kinship pair. Furthermore, 
random separation of kinship pairs into 
different groups could no more eflicaciously 
negate environmentally determined phenotype 
Similarities within pairs than it would control 
Correlation between genetic and environmental 
factors, Even if, contrary to Jensen's (1968, 
D. 7) suggestion, rg were zero, the likelihood 
femains that notable environmental similari- 
ies would exist even for kinship pairs reared 
“part. Prenatal identical 
twins are not random. Nor are the neonatal 
“iranments of twins random, unless sepa- 
Ration occurs at birth. Nor, does it appear 
Майы to assume that twins separated at 
arth have equal and unrelated prospects for 


environments of 
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rearing in an enriched or an impoverished 
environment. The relation between factors 
like socioeconomic status and measured intelli- 
gence even among genetically unrelated mem- 
bers of the same social class is accepted as real 
and substantial (Jensen, 1968, pp. 15-18). 
Environmental factors common to genetically 
related individuals, therefore, could contribute, 
independent of genotype, to reliable differences 
between kinship pairs. The result would be a 
contribution to the correlation between pheno- 
types (r5) quite independent of the pheno- 
type-genotype correlation (r,,). Maxwell 
(1968) has, for example, shown that in the 
estimation of reliability, relaxing the assump- 
tion of uncorrelated errors (the assumption of 
uncorrelated environments in the genetic 
analogy) will result in a biased estimate of the 
reliability coefficient (rpp in the genetic 
analogy). Such bias is usually in the direction 
of overestimation. 

3. The average contribution of Æ to p is 0 
(i.e., Mean of E = 0): The final assumption 
required to satisfy fully an analogy of a 
reliability coefficient with correlation between 
phenotypes does not directly affect. the in- 
terpretation of rj, as a function of pp. Yet 
failure to fulfill the assumption has important 
consequences for the practical utility of rp, 
in relation to trait heritability. Environment 
is viewed as a "threshold variable" whose 
effect on phenotype performance is limited to 
the extent to which it hinders or facilitates 
the attainment of full genetic potential 
(Jensen, 1968. pp. 10-14). This precludes 
estimation of genetic analogues to the standard 
error of measurement and the interval estima- 
tion of true score performance of single 
individuals. As a threshold variable, environ- 
ment must be conceived of as essentially 
negative, having a negative mean and a 
maximum value of zero. The mean of observed 
performance, therefore, cannot be equated with 
the mean of genotype performance as observed 
and true means may be equated in reliability 
theory. And, as a result, a “standard error of 
genetic potential” cannot be estimated as a 
direct function of phenotype variability. 

With due deference to the scholarship em- 
bodied in Jensen’s attack on the problem of 
psychological heritability, his faith in the 
potency of correlation between genetically 
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SELF-CONTROL OF CARDIAC FUNCTIONING: 
A PROMISE AS YET UNFULFILLED ! 


EDWARD B. BLANCHARD? лхо LARRY D. YOUNG 


University of Mississippi Medical Center 


The research in the operant conditioning or self-control of four cardiac func- 
tions, heart rate level, heart rate variability, blood pressure, and cardiac 
arrhythmias, is reviewed. Although the fact of the self-control of these 
functions seems well established, with few exceptions the magnitude of change 


is stati 


i ically significant rather than clinically significant. A few studies, all 
involving numerous training sessions, have reported large magnitude changes 
Several methodological issues are raised and discussed. Finally, alternativ 


modes of achieving self-control of cardiac function, such as progressive relaxa- 


tion and Yoga exercises are discussed. 


tiene mid-1960s an increasing number 
tioning 4 the topic of the operant condi- 
of =i. /or establishment of self-control 
Sema ee processes in humans have 
Out the Ше literature. These articles hold 
of Бе rh either implicitly or explicitly, 
Orders bv ie to treat cardiovascular dis- 
Cologicay паана, rather than pharma- 
Nation of dm However, a critical exami- 
Purpose of » Iterature, which is the principal 
for the m the present article, reveals that. 
fulfilled € part, this promise is, as yet, un- 
reported ا‎ rare exception, the results are 
lon! Or changes in cardiovascular func- 
and + Sag normal subjects instead of patients 
een of magnitude of change reported has 
Clinically statistically significant, rather than 
* Significant, magnitude. 
Our m is summarized in terms of the 
Eon ыс Ovascular responses studied, with 
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particular. emphasis on the magnitude of 
change obtained. From this summarization, 
several methodological issues become appar- 
ent and are subsequently discussed. 


Heart КАТЕ 


In Table 1 are summarized the studies in- 
volving change in heart rate as the dependent 
variable. Omitted from this table, and from 
intensive consideration by this review, are 
studies by Shearn (1962), Frazier (1966), 
and Brener (1966), all of which used a shock 
avoidance procedure. Basically these studies 
allowed the subject to avoid mild but painful 
electrical shocks by emitting a heart rate of 
the appropriate magnitude. These studies, 
which did show that subjects could learn some 
degree of heart rate control, are omitted be- 
cause such a paradigm was not thought to be 
relevant for clinical practice. 

Inspection of Table 1 reveals that each 
experimenter has tended to use his own indi- 
vidual method of determining if subjects can 
change their heart rate. Although, with three 
exceptions, the magnitude of the change in 
heart rate achieved is quite small, ranging 
1-7 beats per minute, nevertheless, it seems 
fairly well established that small, but sta- 
tistically significant, changes in heart rate can 
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be obtained through giving subjects feedback 
or reinforcement. However, these changes, 
though of sufficient magnitude to be statisti- 
cally significant, are not clinically significant. 
For the purposes of this review, a change in 
response is considered to be clinically signili- 
cant if it is equal to either 20% of the base 
rate or, in the case of an abnormally high 
response, the change is such as to return the 
response to the normal range. 

The three studies which have demonstrated 
large magnitude changes in heart rate include 
a single subject whom Headrick, Feather, and 
Wells (1971) ran for 14 sessions and who 
demonstrated within-trial increases in heart 
rate of 30 beats per minute for the short 1- 
minute trial. Recently Stephens, Harris, and 
Brady (1972) have reported on four normal 
subjects who were run under a complex com- 
bination of feedback and reinforcement con- 
ditions for 10-18 sessions of trials varying 
5-60 minutes, In all four subjects clinically 
significant within-trial increases in heart rate 
were obtained, ranging 15-29 beats per min- 
ute; moreover, two of the subjects were able 
to decrease their heart rate 5-8 beats per 
minute from base line. A final study by Scott 
et al. (1972)? reported clinically significant 
increases in heart rate of 16 and 25 beats ре! 
minute in two normal subjects which were 
maintained over three consecutive daily ses- 
sions. More importantly, a stable (over daily 
sessions) 16 beats per minute decrease 1 
heart rate was obtained in a patient suffering 
from tachycardia, Coincident with the de- 
crease in heart rate (in the latter subject 
were several clinical improvements including 
obtaining gainful employment. This last case 
is the only reported direct clinical application 
of the studies of heart rate conditioning 0 
date. xi 

The methodological issues and conflict? 
which have arisen in this area are presente’ 
next. 


Operant Conditioning versus Selj-Control Я 
; р Р „ wet 1 

The first issue is one of terminology, oi 
extends beyond mere semantics. Enge 


2 RW. Scott, E. B. Blanchard, E. Edmundson, © 
L. D. Young. An improved shaping procedure Hi 
clinical cardiac control. Paper presented at S 
eastern Psychological Association meeting, 
1972. 
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his associates refer to their operations as 
Operant conditioning. In addition to visual 
feedback, their subjects were given small sums 
of money for accumulated experimental time 
during which their heart rate was in the 
appropriate direction relative to a criterion 
heart rate. Stephens et al. (1972) followed a 
similar procedure. 

Most of the other studies of heart rate 
change are described in terms of self-control 
established through external sensory feedback. 
It is possible, of course, to construe all of 
these studies as examples of operant condi- 
tioning in which the reinforcement is knowl- 
edge of having successfully emitted the cor- 
Fect response. On the other hand, it is equally 
Possible to construe the reinforcement, espe- 
cially that used by Bergman and Johnson 
(1972), in informational terms as binary 
feedback, since subjects receive reinforce- 
Ment if they are correct and do not receive it 
if they are not making the correct response. 

If one considers knowledge of results to be 
2 reinforcer, feedback and reinforcement are 
inextricably confounded in the feedback stud- 
tes. Therefore, it is not possible to settle the 
question by merely giving one group of sub- 
v feedback and a second group reinforce- 

» and then comparing their performance. 
era it would be possible to determine if 
бар forcement used by Engel has an effect 
ing th that achieved by feedback by compar- 
in Eng, performance of groups of subjects run 
tary vox 5 paradigm with and without mone- 

einforcement, 


T4 
‘Pe of Feedback 


А There ca- 


tion of are two dimensions in the sg зи : 
Sensory types of feedback used. The nne ^ 
su hiec modality, Feedback has been given 

or by ra һу either auditory or visual means, 
ог Nur with no apparent rationale B en 
Chard erring one mode to the other. m 
Пе г and Young (1972), in their study О 
l reltitive efficacy of these two modes, made 
Sons Within- and between-subject ош 
пе Es found no statistical айша ге : 
Wou sory modality over the other. - us е 

S" Seem that the decision as to which gr 
a у, dality one should use, particularly in 
ier 180 procedure, must rest on factors 
an efficacy. 
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In describing the second classification di- 
mension, we have introduced some new termi- 
nology which may require explanation. By 
binary feedback we refer to a situation such 
as that in studies by Engel and his associates 
and by Brener and his associates wherein the 
subject's heart rate in terms of interbeat in- 
tervals, the reciprocal of heart rate, is com- 
pared to some criterion on a beat-by-beat 
basis. After each comparison, the subject is 
automatically informed as to whether or not 
he has met or exceeded the criterion by the 
presence or absence of the feedback signal, 
respectively. This class of feedback has been 
delivered in either the auditory mode (Brener, 
Kleinman, & Goesling, 1969) or the visual 
mode (Engel & Chism, 1967b; Engel & Han- 
sen, 1966). 

By proportional feedback, we refer to a 
situation such as that in studies by Finley 
(1971), Sroufe (1971), Blanchard et al. 
(1972) in which the subject is provided with 
direct knowledge of heart rate on a beat-by- 
beat basis. With this kind of feedback, the 
subject is told not only if he is right or 
wrong as in binary feedback, but also by how 
much. At the time of this review no one has 
compared binary and proportional feedback 
with regard to relative efficacy. 


Knowledge of Response by the Subject 


Whether or not the subjects should be in- 
formed of the particular response which they 
are to change is an issue in the field of heart 
rate control which provokes almost as much 
mixed opinion as the particular feedback ar- 
rangement to use. From Table 1 one can see 
that researchers are fairly well divided as to 
whether or not to inform the subjects of the 
nature of the response. For example, Brener 
changed his policy on informing the subjects: 
in his first two studies (Brener & Hothersall, 
1966, 1967) the subjects were not informed; 
in his later study (Brener et al., 1969) they 
were informed. 

This difference of opinion seems to date 
back to the early work of Engel and his asso- 
ciates. For reasons which are unclear in their 
reports, they elected not to inform subjects 
what response was related to the feedback 
and reinforcement. Engel and Hansen (1966) 
found, post hoc, that subjects who did not 
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correctly infer the response that was related 
to the feedback and reinforcement contingency 
showed evidence of learning the heart-rate- 
slowing response whereas four of their five 
"nonlearners" correctly guessed that heart 
rate was the response, In a second study 
(Engel & Chism, 1967b) although all five 
subjects showed evidence of heart rate speed- 
ing, four of the five subjects did not know the 
nature of the response, 

Bergman and Johnson (1972) have taken a 
Step toward resolving this issue by treating 
awareness of the response as an independent 
variable. Of two groups of subjects receiving 
no feedback of heart rate, those informed of 
the response raised it significantly higher 
(AX — L8 beats per minute) than unin- 
formed subjects trying to control an unspeci- 
fied response, Interestingly, a third group of 
subjects who were both informed and given 
feedback in the form of heart sounds did no 
better than the informed, no-feedback sub- 
jects, 

One would hope that further research in 
this area will show that knowledge of the 
response being changed, at the very least, has 
no detrimental effect since in a clinical appli- 
cation of these procedures one would prefer to 
be able to inform patients fully about the pro- 
cedure. 


Types of Control Groups 


Several different policies on control groups 
have been followed in these studies. One pol- 
icy has been to use no formal control group, 
but instead to use subjects as their own con- 
trols by comparing heart rate during experi- 
mental trials to that recorded during some 
other period, This within-subject control pol- 
icy was followed by Blanchard, Young, and 
McLeod, 1972; Brener and Hothersall, 1966, 
1967; Brener et al, 1969; Headrick et al., 
1971; Levene, Engel, and Pearson, 1968; and 
Stephens et al., 1972. 

А second policy on control groups, which 
has been followed by Engel and Hansen 
(1966) and Engel and Chism (1967b), is the 
use of yoked controls. In this procedure the 
control subject receives the same feedback 
signal as the experimental subject to whom he 
is yoked. A similar form of false feedback 
has been used by Finley (1971). 
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Church (1964) has pointed out the sta- 
tistical and methodological fallacies inherent 
in the use of yoked controls. Examination of 
the results for subjects given some form of 
false feedback reveals that the performance of 
these yoked control subjects tends to deteri- 
orate, that is, their responses are in the wrong 
direction. For example, in the study of heart- 
rate-slowing by Engel and Hansen (1966) 
the control subjects achieved a mean increase 
in heart rate of 5.6 beats per minute. Further- 
more, Finley's (1971) control subjects showed 
an increased heart rate (AY =.8 beats per 
minute) when they received false feedback as 
contrasted to times when they received no 
feedback. Comparisons of changes in heart 
rate for experimental and control subjects 
thus yield spuriously large effects because of 
capitalizing on differences which include the 
deterioration of the controls. 

The appropriate control condition would be 
one in which subjects receive the same in- 
structions as the experimental subjects but no 
feedback. The inclusion of an instructed, no- 
feedback control has been followed by Blan- 
chard and Young (1972), Bergman and John- 
son (1972), Brener et al. (1969), Johns 
(1970), and Sroufe (1971). 


Changes in Heart Rate in the Absence of 
Feedback or Reinforcement 

The use of a no-feedback condition enables 
one to detect if subjects can significantly 
change their heart rate in the absence of T. 
ternal feedback or reinforcement, The resu a 
on this point are mixed. Bergman and John- 
son (1971) found both significant increases 
and decreases for very brief (4—6 seconds) 
trials in the absence of any feedback. [n a 
later study, described above, Bergman aní 
Johnson (1972) found that subjects instructet 
to increase heart rate but given no feedbac? 
did as well as similarly instructed subject 
who received a form of auditory feedback. 

Although the subjects of Brener E А 
(1969) who received feedback on 100% art 
the trials showed significantly greater vue 
rate control than those subjects who receiv 
no feedback, the latter group did Show, e 
small (AX — 2.4 beats per minute) but ws 
nificant difference between raise and ues 
trials. However, since the results were 
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ported as differences, no directional state- 
ment can be made írom their data. Both 
Johns (1970) and Blanchard and Young 
(1972) found that their no-feedback control 
subjects could lower heart rate but could not 
raise it (X = —2.3 beats per minute, X = 
-22 beats per minute, respectively). At this 
Point one cannot conclude definitely that 
feedback enables subjects to change heart 
rate in either direction more than subjects 
Who are similarly instructed but receive no 
feedback, ` 
Another way of viewing this question is to 
examine how well subjects who have been 
trained to control heart rate with feedback 
Can control their heart rate in the absence of 
feedback. This is a point of considerable clini- 
Cal significance since transfer of the self- 
Control that patients learn in an experimental 
Setting to their natural environment is essen- 
tial. Brener et д], (1969) found significantly 
Sreater generalized self-control for subjects 
Sven feedback on 100% of trials than to 
Subjects given no feedback. 
4 Le A d Pone =ч 
has achieved si ага ; np aa jects, who 
Were 5 ved similar results, His subjects, Wio 
Вади to control heart rate variability 
Proporti paced n piration and who were given 
Sant p ‘onal visual feedback, showed signifi- 
ansfer of self-control to a no-feedback 


State ۾‎ Д Pues 
“te after only a brief (5-minute) training 
Deriod, Ў 


Respiratory Effects 


E" ү in the field of self-control of car- 
Bim. Parum surrounded by much pai 
tional be the relative importance of media- 
ariables, Although mediation by means 
Prices or unspecified skeletal muscular 
Ocus ET has been suggested, the primary 
On the Sus mediation controversy Баз, be 
Menta] home of respiratory variables. Peer 
ocedures designed to answer whether 


fron the obtained heart rate change resulted 
era] “Spiratory changes fall into four ge 
алей ОПЕ Although no a 
Else i monitor respiration, some did u : 
Brene, OT and  Hothersall ( 1900) Bee 
"ation i» al. (1969) report monitoring Bm 

Jub do not mention any instructions 
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concerning breathing nor do they report any 
analyses of the observed differences. 

Blanchard and his associates ( Blanchard et 
al, 1972; Blanchard & Young, 1972) and 
Engel and his associates (Engel & Chism, 
1967b; Engel & Hansen, 1966; Levene et al., 
1968) gave all subjects in their investigations 
the explicit instructions not to change their 
breathing pattern from normal. Again, how- 
ever, only monitoring and visual inspection of 
the data were used to reach the conclusions 
that respiration did not change significantly 
and that the observed heart rate differences 
did not result from respiratory changes. Le- 
vene et al. (1968) did carry the investigation 
a step further, however, by teaching two sub- 
jects whose respiration rates were variable to 
duplicate their earlier respiration rate through 
a pacing technique. This resulted in changes 
in heart rate variability but not in overall 
heart rate. 

The third approach to the problem, namely, 
manipulation of respiration rate to insure its 
constancy during heart rate conditioning ses- 
sions, was employed by Brener and Hother- 
sall (1967), Johns (1970), and Sroufe (1969) 
and basically consists of having subjects pace 
their respiration to a metronome or rhythmic 
puff of air set at their adaptation respiration 
rate. Even with respiration rate held con- 
stant experimentally, subjects were still able 
to alter their heart rate significantly in the 
appropriate direction. However, as Sroufe 
pointed out, ability to control heart rate ap- 
pears to deteriorate with any session when 
the subject is attempting to control two sys- 
tems simultaneously, 

The final method used (Finley, 1971; 
Headrick et al, 1971; Hnatiow & Lang, 
1965: Lang, Sroufe, & Hastings, 1967) to 
insure that changes in heart rate were not the 
result of respiratory differences is to monitor, 
analyze, and control statistically through co- 
variance analyses any systematic variations 
in respiration that are found, There have been 
no significant relationships between heart rate 
change and respiratory changes with two 
Headrick et al. (1971) 
correlation between 


minor exceptior 
found a significant | 
breathing amplitude and heart rate for only 
one subject: and Hnatiow and Lang's (1965) 
single significant correlation was between 
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average heart rate deviation during the final 
no-feedback period and the average inspira- 
tion to total cycle ratio for the final feedback 
period. 

A summary of cardiorespiratory relations 
is not complete without the inclusion of two 
studies which focus specifically on them, In 
the earlier investigation (Engel & Chism, 
1967a) 20% changes in respiration rate sus- 
tained for a 10-minute period did not cause 
changes in average heart rate. However, an 
inverse ri ation was noted between respira- 
rate and heart rate variability. Sroufe's 
1) series of three experiments was 
broader in scope since he manipulated respira- 
tory volume in addition to respiration rate. 
His results confirmed the earlier finding of 
Engel and Chism (1967a) that changes in 
respiration rate affect only heart rate varia- 
bility with increases in respiration rate pro- 
ducing a more stable heart rate. However, 
changes in respiratory volume affected both 
the level and variability of heart rate with 
deeper breathing producing a higher but more 
variable heart rate. 

Tn light of the evidence, monitoring of res- 
piratory variables seems mandatory for those 
investigations attempting to show heart rate 
control. However, monitoring alone is not 
sufficient; the effects of respiratory changes 
must be controlled either experimentally or 
statistically to eliminate this possible source 
of confounding. As Sroufe (1971) has so 
aptly observed, “The fact that cardiac con- 
trol is possible with respiration experimentally 
controlled merely suggests that subjects сап 
control rate independent of respiration, not 
that respiration is of minor importance [p. 
654].” 

Sroufe’s (1971) research carries several im- 
plications and Suggestions for subsequent 
research in the area of heart rate self-control, 
particularly clinical studies: Since respiration 
and skeletal muscle activity are more dis- 
criminable than cardiac activity, they might 
profitably be used early in training and grad- 
ually faded as the desired response is ac- 
quired. As Sroufe (1971) Showed, utilization 
of these or similar techniques improved the 
transfer of the learned cardiac control from 
periods where feedback is present to periods 
where it is absent: ultimately, utilization im- 
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proved the first step in establishing generali- 
zation from the experimental chamber to the 
natural environment. 


Heart RATE VARIABILITY 


Lang and his associates have been the prin- 
cipal researchers investigating the self-con- 
trol of heart rate variability. All of their 
studies have used the same basic experimental 
procedure: The experimental subjects are 
given proportional visual feedback of their 
heart rate and asked to keep it within #3 
beats per minute of a set point (their mean 
preexperimental heart rate). Dependent mea- 
sures are the standard deviation of the dis- 
tribution of interbeat intervals and time on 
target (within the = beats per minute range). 

In the first study (Hnatiow & Lang, 1965) 
experimental subjects performed significantly 
better than yoked controls who received false 
feedback and who were told only to track us 
visual display, with no mention of controlling 
it. 

The second study (Lang, Sroufe, & Ше. 
ings, 1967) was a refinement of the де 
include additional controls. Subjects in the 
experimental group again received propor- 
tional visual feedback of heart rate, were tal 
what the feedback display represented, pr 
were instructed to try to keep their heart d 
within a 6 beat per minute range. Those a 
one control group received the same instru 
tions but the feedback display of their aen 
experimental partner. The other two m 
groups were not informed concerning p. 
the visual display represented, and were 
only to track the visual display. 

The experimental group was s , 
better at reducing heart rate variability 
any of the controls. Of special note аге 
within-subject analyses which showed th Шу 
experimental subjects improved x pagan 
from no-feedback to feedback-available ne 
ditions. The control subjects yoked one 
experimental subjects, who were also АР tin 
to control heart rate, did poorest, зир? pro- 
our previous criticism of this contro 
cedure. ; 

In the third study of this series 
1969), subjects again were given eit! 
or false proportional visual feedback s ec 
rale and told to keep it within a SP 
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range. The purpose of the study was to con- 
trol for respiration effects and the possibility 
that subjects were reducing heart rate varia- 
bility through respiratory means. Thus three 
groups were taught, over several sessions, to 
breathe at a fixed rate, while a fourth group 
had no restrictions placed on their breathing. 
АП subjects received two training sessions be- 
fore data were collected in a third session of 
l-minute trials. Again, subjects given true 
feedback showed significant reductions in 
heart rate variability during times it was 
available compared both to no-feedback pe- 
Tiods and to subjects receiving false feedback. 
For subjects breathing at fixed rates, per- 
7^ — formance tended to deteriorate over time while 
no such deterioration was found for subjects 
Not under respiratory restriction. 

Interestingly, a final study in this series 
(Hnatiow, 1971) failed to replicate the early 
findings of better control by subjects receiv- 
Ing feedback than the yoked controls in spite 
of using identical equipment and procedures. 

To date, Lang and his associates have re- 
Ported no clinical application of their pro- 
cedure. In fact, all of their work has been 
With normal college students. 


BLOOD PRESSURE 

T Table 2 are summarized the studies in- 

Ng change in blood pressure as the de- 
bendent variable. 

n all of their blood pressure control stud- 

^ Shapiro and his colleagues at Harvard 

Ave used the same basic experimental pro- 
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Korotkoff sounds within a specified time in- 
terval after each heart beat. Feedback deliv- 
ery is dependent upon the appropriate re- 
sponse. For example, in trying to shape a 
decrease in systolic blood pressure, feedback 
would be given the subject if no Korotkoff 
sound was detected following the R wave of 
the electrocardiogram. Had a sound been de- 
tected, indicating that the systolic pressure 
was above the cuff pressure, no feedback 
would be given. Typically subjects are given 
25 trials per session with a trial being 50 
interbeat intervals long. 

As can be seen in Table 2, with the excep- 
tion of their recently completed clinical trial 
( Benson, Shapiro, Tursky, & Schwartz, 1971), 
the magnitude of change in blood pressure has 
been rather small, averaging less than 1 milli- 
meter of mercrury per trial for subjects who 
were increasing blood pressure and about 2-3 
millimeters of mercury per trial for subjects 
lowering their blood pressure. Using as an in- 
dex of the efficacy of their procedure the 
average blood pressure on the final block of 
five trials, the average amount of increase in 
blood pressure has been 1-2 millimeters of 
mercury while the average decrease in sub- 
jects lowering blood pressure has been 4-6 
millimeters of mercury. 

Despite their impressive series of studies, 
manv of the criticisms previously leveled at 
the research on self-control of heart rate can 
be supplied to the blood pressure self-control 
studies. First and foremost, it is not possible 
to tell what are the active ingredients in the 
Shapiro procedure since subjects receive both 
binary auditory and visual feedback as well 
as reinforcement on some fixed ratio schedule. 
Whether reinforcement contributes to the effi- 
cacy of the procedure is not known, nor are 
the relative efficacies of the types of feedback. 

A second problem with these studies has 
been the failure to include a no-feedback con- 
trol in order to evaluate the efficacy of their 
combination of procedures over merely in- 
structing subjects to lower their blood pres- 
sure, In one study (Shapiro et al. 1970a) a 
group reinforced on a random basis was in- 
cluded, Results showed that groups reinforced 
for raising or lowering blood Pressures; while 
different from each other, did not differ from 


this group which was reinforced randomly. 
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Thus to date, this group of investigators has 
shown primarily that subjects reinforced for 
raising blood pressure are different from sub- 
jects reinforced for lowering blood pressure. 

Despite the methodological problems with 
these studies, the Shapiro group certainly de- 
serves praise for their pioneering work in this 
area. Recently they have focused on self-con- 
trol of diastolic blood pressure (Shapiro et al., 
1972) and the degree of integration and 
differentiation of the cardiovascular system 
(Schwartz, 1972; Schwartz, Shapiro, & 
Tursky, 1971). The former line of research, 
using the same basic experimental paradigm, 
has shown that diastolic blood pressure can 
be altered significantly by this procedure and, 
of much interest from a clinical point of view, 
that the changes show some degree of sta- 
bility after the feedback and reinforcement 
are no longer available. In the latter research 
area, a fair degree of independence of heart 
rate and blood pressure have been shown. 

The study by Benson et al. (1971) answers 
one of our major criticisms leveled at the 
blood pressure self-control research, namely, 
the exclusive reliance on normal subjects. Ben- 
son applied Shapiro's basic blood pressure 
control procedure to a group of seven hyper- 
tensive patients. Their average systolic blood 
pressure during the final base-line periods 
was 165 millimeters of mercury. Clinically 
significant decreases in blood pressure were 
obtained in five of the seven patients (X — 
22.6 millimeters of mercury). 

As with the heart rate studies which showed 
large scale changes in that response, numerous 
trials were necessary to obtain this magnitude 
of change (X — 21.7 conditioning sessions of 
25 trials each). While no data are available 
on the stability of the change when the feed- 
back was no longer available, or on the degree 
of transfer of the decreased blood pressure 
outside of the laboratory, these results are 
provocative enough to warrant following up 
and certainly to increase the clinical promise 
held out by this research, 

As can be seen in Table 2 there has been 
only one important study of blood pressure 
self-control conducted by an investigator out- 
side of the Shapiro group at Harvard, and 
that is by Brener and Kleinman (1970). This 
study, which utilized a form of proportional 
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visual feedback, has several commendable 
features: The size oí the change in blood 
pressure, approximately 15  millimeters of 
mercury in the first session and approxi- 
mately 12 millimeters of mercury in the sec- 
ond, is considerably larger than that reported 
by Shapiro and his colleagues with normal 
subjects, and is of a magnitude which could 
be clinically significant. Second, control sub- 
jects who received the same type of feedback 
of their own blood pressure but were not in- 
structed to change their blood pressure were 
used and showed no overall change in blood 
pressure. A further control group is needed, 
however, in which subjects, who are in- 
structed to try to lower blood pressure, re- 
ceive no feedback. | 

The form of feedback used within the с1аз51- 
fication system presented previously is closest 
to proportional visual feedback, Subjects were 
able to view a manometer which continuously 
displayed pressure in the occluding staff. The 
cuff pressure, and hence the feedback. ге- 
ceived by subjects, oscillated about the indi- 
vidual subject's systolic blood pressure. Ап 
interesting aspect of this procedure was the 
taking of blood pressure on the left index 
finger rather than upper arm. Such a pro- 
cedure minimizes the problem ischaemia. 

The size changes in blood pressure ob- 
tained by Brener and Kleinman’s procedure 
are much larger than those obtained by age 
piro and his colleagues in a similar length © 
time on a similar population of normotensive 
college-age males. One could speculate that the 
major reason for this difference is that Brene! 
and Kleinman's subjects received a form 0 
proportional feedback whereas Shapiro's p 
jects received binary feedback delivered both 
visually and aurally. 

CARDIAC ARRHYTHMIAS 

Although there have been several brief 1 
ports (Engel & Melmon, 1968; Weiss & Bopa 
1970) on the operant conditioning of care з 
arrhythmias, most of the data seem to be v 
marized in one article (Weiss & Engel. А 
The latter study contains detailed repor жа 
eight patients suffering from premature m 
tricular contractions who were all (геше rt 
approximately the same experimental pari 
digm. 
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The paradigm was, for the most part, a rep- 
lication of the procedures used in Engel's three 
previous studies. Patients were given binary 
visual feedback as to whether their heart rate 
was at or beyond the desired heart rate in 
the appropriate direction. In this study, con- 
trary to the previous work with normals, pa- 
tients were fully informed of all the details 
of the experiment, especially the response be- 
ing monitored and also were not given mone- 
tary rewards for time accumulated in the cor- 
rect direction. Despite the omission of this 
tangible reinforcer, the authors still describe 
their study in terms of “operant conditioning” 
rather than self-control through feedback. 

Most patients were run under four condi- 
tions and some under a fifth, These were, in 
the usual order, heart rate speeding, heart rate 
slowing, alternately speeding and slowing 
heart rate for 1—4 minute trials, and then con- 
trol of heart rate variability. In this final con- 
dition, subjects received binary feedback 
(light onset) when their heart rate stayed 
within a 10-beat range. The nature of this 
feedback arrangement also provided subjects 
with feedback of the occurrence of each pre- 
mature ventricular contraction. The fifth con- 
dition, in which four of the patients were run, 
consisted of a gradual removal of feedback. 
Feedback was alternately presented and re- 
moved; the initial ratio of 1 minute of feed- 
back to 1 minute without was extended to 1 
minute on and 7 minutes off. Interestingly, in 
this final phase subjects did as well without 
feedback as with it, indicating that true self- 
control had been learned. . 

Four of the eight patients had a signifi- 
cantly reduced frequency of premature ven- 
tricular contractions after the training pro- 
cedure as measured in the ward and at a 
follow-up 3-21 months afterward. А fifth 
patient did not have a significant reduction in 
premature ventricular contractions rate but 
did learn to recognize their occurrence and 
was able to control them at home through 
resting, The other three patients showed no 
evidence of learning to control premature 
ventricular contractions. 

Total number of training sessions ranged 
22-77 with a mean of 57.5. The five cases 
Which can be considered successes all had 47 


Or more training sessions. Our reanalysis of 


159 


these results show that improvement is sig- 
nificantly (by р = .05, Fisher’s exact proba- 
bility test) related to receiving 47 or more 
training sessions. 

Weiss and Engel deserve commendation for 
the following two related techiques for ex- 
tending their procedures into the area of 
clinical application: first, eliminating the 
necessity for external feedback to control the 
cardiac dysfunction, and more importantly, 
monitoring premature ventricular contraction 
rate outside the experimental chamber. The 
degree of generalization observed in this lat- 
ter procedure is especially encouraging. Gen- 
eralization of learned control of cardiac func- 
tioning to the natural environment, true self- 
control is the desired clinical terminal be- 
havior. 

There are two obvious criticisms which can 
be leveled at this otherwise excellent clinical 
study. The first is the lack of a controlled | 
group of patients who were either untreated 
or given some equally time-consuming and in- 
volving placebo task. This criticism is obvi- 
ated to some degree because of the prolonged 
base-line period prior to the experimental 
phase during which premature ventricular 
contractions were noted. The documented his- 
tory of premature ventricular contractions 
ranged 3 months-8 years with a mean of 3.0 
years, Despite this, one would hope that a 
next step would be a controlled outcome study 
of the procedure. 

The second criticism is the obvious con- 
founding of several aspects of the training 
procedure. One does not know whether the 
entire sequence of training procedures is nec- 
essary or only part of it, Thus a next step 
would be to isolate which parts of the training 
procedure are necessary, facilitative, or su- 
perfluous, in order to improve efficiency. 


ALTERNATIVE METHODS OF CARDIAC CONTROL 


There are at least two additional lines of 
research which are related to the area of self- 
control of cardiac function: (a) the use of 
progressive relaxation as developed by Jacob- 
son (1938) and (5) the use of various Yogic 
exercises and related forms of mediation. In 
both of these a significant degree of change 
in various cardiac functions has been pro- 


duced. 


160 EDWARD B. BLANCHARD AND LARRY D. YOUNG 


Progressive Relaxation 


Jacobson (1939) reported on the effects on 
blood pressure of training in and subsequent 
use of his system of muscle relaxation. His 
conclusions, which tended to be more quali- 
tative than quantitative, were that there is a 
general relation between decreases in blood 
pressure and decreases in muscle activity as 
shown by the electromyogram and that train- 
ing in “progressive relaxation" results in 
greater decreases in electromyogram activity 
than self-induced relaxation without training. 
He presented blood pressure data on 14 nor- 
motensive subjects, 9 of whom had been 
trained in progressive relaxation, These data, 
both systolic and diastolic blood pressure, 
were collected after a 15-minute adaptation 
period during which the subjects rested in a 
supine position and again after 45 minutes of 
self-induced relaxation. Analyses of these data 
by the present authors revealed significant 
decreases in both systolic blood pressure 
(AX — 8.0 millimeters of mercury; Ё = 19.9, 
dj = 1/12, p < 01) and diastolic blood pres- 
sure (AX — 7.8 millimeters of mercury; F — 
11.5, df = 1/12, p<.01), but no differences 
between the trained and untrained subjects in 
degree of reduction achieved. It should be 
noted that the magnitude of change in blood 
pressure in this study exceeds that reported 
by Shapiro and his colleagues in any of their 
studies of normotensive subjects (see Table 
2) and compares favorably with the results 
reported by Brener and Kleinman (1970). 

In a second study, Jacobson (1940) re- 
ported decreases in heart rate associated with 
relaxation training. Heart rate was measured 
initially after a 15-minute adaptation period 
during which the subjects rested in a supine 
position and later after 45 minutes of pro- 
gressive relaxation, The average decrease in 
heart rate was 6.5 beats per minute. Analysis 
of this data by the present authors shows 
this change to be highly significant (= 5:50, 
р < 001). 

In a more recent study, Paul (1969) re- 
ported on physiological changes in college-age 
females who were treated for 30 minutes by 
either (a) a brief form of progressive relaxa- 
tion training, (b) hypnosis induction and sug- 
gestions of relaxation, or (c) seli-instructed 


relaxation. Heart rate data were collected 
during a 10-minute adaptation period and at 
the end of training. The progressive relaxa- 
tion training group showed significantly 
greater changes in heart rate than the other 
two groups with reductions of approximately 
8 beats per minute occurring in both sessions. 
It seems possible that Paul's subjects were not 
completely adapted when the initial heart 
rate recording was made which could ac- 
count for part of the magnitude of change. 

In a later report Paul and Trimble (1970) 
compared subjects given training in brief 
progressive relaxation either by tape recording 
or by a therapist present in the room. The 
latter condition led to significantly greater 
decreases in heart rate; however, the tape- 
recorded relaxation treatment did result in a 
group mean decrease in heart rate of approxi- 
mately 4 beats per minute which is compara- 
ble to the results reported by most other in- 
vestigators. 

It thus seems that training in deep muscle 
relaxation, even of an abbreviated form, can 
lead to decreases in heart rate comparable to 
those achieved by most of the studies in the 
biofeedback area. This finding leads us to 
speculate that the operant conditioning or self- 
control procedures are possibly only inefficient 
methods of teaching subjects to relax. 


Yoga 


Although many claims of phenomenal de- 
grees of self-control among Yogis had been 
made, few scientific data were collected ОП 
this subject until the 1950s when Wenger аш 
his associates (Wenger & Bagchi, 1961; 
Wenger, Bagchi, & Anand, 1961) made SYS 
tematic autonomic recordings from over 5; 
Yogis in India. In one study of three Yogis 
who claimed to be able to stop their heart, 
Wenger et al. (1961) found (a) absence " 
radial pulse, (5) absence of most heart gp 
(c) blood pressure radically increased, (e 
(d) no stoppage of the heart and, in fact, ^ 
very little slowing in heart rate. They sd 
cluded that these subjects were utilizing AE 
racic muscle interference with venous yu 
which led to a radical decrease in blood 7? 

А fourth Yogi claimed only to be ие а 
slow his heart rate and did so, demonstrat se 
increases in interbeat intervals from a 0 
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line of approximately 1.0 second to slightly 
over 2 seconds for several beats (equivalent 
to changes in heart rate from 60 beats per 
minute to 30 beats per minute. The authors 
concluded that no muscle interference was 
apparently used; rather, the alterations 
seemed to be due to vagal innervation. 

A similar example of voluntary cardiac 
arrest was reported by McClure (1959) in 
the case of a 44-year-old Caucasian male. 
This patient, who showed no cardiac irregu- 
larities, could voluntarily go into cardiac ar- 
rest for periods of approximately 5 seconds, 
increasing the interbeat interval of his heart 
rate from normally 1.0 second to over 5 
seconds, Although the patient did not know 
the mechanism involved, McClure concluded 
that it was due to vagal innervation and was 
entirely unrelated to respiration rate. 

Three systematic studies of the effects of 
Yoga exercises and meditation on cardiac 
function have recently been reported. Datey, 
Deshmukh, Dalvi, and Vinekar (1969) stud- 
ied the effects of training in the Yogic exer- 
cise called *Shavasan" on 47 hypertensive 
patients. The patients, whose original blood 
pressures were in the range 160-270/90-145, 
all received training in Shavasan for unspeci- 
fied lengths of time. The typical training 
Course was described as requiring 3 weeks of 
daily 30-minute sessions. Results were re- 
Ported for three groups of patients: (a) 10 
not on medication; (5) 22 whose hyperten- 
Sion was controlled by drugs; and (c) 15 
Whose hypertension was not adequately con- 
trolled by drugs. In all cases blood pressure 
Was reported as the diastolic pressure plus 
one-third of the pulse pressure. For those not 
on medication blood pressure was reduced in 
9 out of 10 cases with the average reduction 
being 27 millimeters of mercury (р < .05). In 
the second group the effects of training in 
Shavasan were evaluated in terms of the re- 
duction in patients’ drug dosage necessary to 
keep their blood pressure under control. In 13 
9f 22 patients, the drug dosage was decreased 
Y 33% or more, with an average reduction 
for those 13 patients of 68%. In the third 
Btoup the effects were evaluated in the same 
ashion and drug dosage Was reduced by at 
east 33% in 6 of 15 patients. This study 
Seems to show that a chronic course of train- 
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ing in yogic exercise had a clinically signifi- 
cant effect on a majority of the hypertensive 
patients studied, and would warrant investi- 
gation in a controlled-outcome study. 

In the first of two studies of similar design 
Wallace (1970) studied the physiological pa 
fects of “transcendental meditation,” as taught 
by the Maharishi Mahesh Yogi. Physiological 
responses of 15 “normal” college students, 
who had been practicing ‘transcendental 
meditation" for from 6 months to 3 years, 
were monitored both for 30 minutes prior to 
a 20-30 minute meditation period and during 
the latter period. In the five subjects for 
whom heart rate was measured, an average 
decrease of 5 beats per minute was observed. 
It was not stated, however, whether or not 
the magnitude of change was statistically sig- 
nificant. 

In a more elaborate study of similar design, 
Wallace, Benson, and Wilson (1971) moni- 
tored physiological responses of 36 subjects 
who had been practicing “transcendental 
meditation" for an average of 29 months. The 
principal results of comparisons between the 
meditation. period and the 30 minutes im- 
mediately preceding, during which subjects 
were resting comfortably, were significant 
changes in electroencephalogram and oxygen 
consumption, There was also a small but sig- 
nificant (p < .05) decrease in heart rate (AY 
= —.3 beats per minute), but no change in 
blood pressure. 

These studies, though not of the controlled- 
outcome format in which a “no-treatment” or 
“attention-placebo” group is monitored, are 
highly suggestive that Yoga and other East- 
ern forms of exercise-meditation training can 
have important clinical application in the field 
of cardiac self-control. Certainly the changes 
achieved are of sufficient magnitude to war- 
rant exploring this line of research as well as 
the ones discussed previously. 

CONCLUSIONS 

The self-control of several cardiac func- 
tions, acceleration, deceleration, and variabil- 
ity of heart rate and increases and decreases 
in blood pressure, has been amply demon- 
strated in normal subjects. However, with few 


notable exceptions the magnitude of change 
obtained has been small, the duration for 
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which control was manifested has been brief, 
and the generalization of the changes obtained 
in the experimental chamber rarely demon- 
Strated. Future research in this area must 
follow two directions if the promise it holds 
t is to be fulfilled: (a) there is now a need 
chronic studies involving many trials to 
ain large-scale changes; (5) there is also 
need for more direct application of this 
. Work to clinical populations. Finally, as was 

- pointed out in the final section of this paper, 
` alternative modes of obtaining changes in 
cardiac functioning such as training in pro- 
gressive relaxation, transcendental meditation, 
or various Yoga exercises, may prove to be 
more fruitful approaches to obtaining clini- 
cally significant changes than the operant- 
conditioning~feedback techniques reviewed in 
the main body of this paper. Only compara- 
tive outcome studies, conducted on a chronic 
basis, can answer this question, but certainly 
it is deserving of an answer, 
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This note is a reply to Rawlings’ criticisms of 
The method of analysis proposed by Rawlings а 
equivalent to our method, which he criticized as incorrect. 


authors. 
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a previous article by the present 


inherent in the computational procedures proposed by Rawlings for more complex 


y 
wlings (1972) recently criticized a method 
suggested by the present authors for the anal- 
ysis of variance of data from nonorthogonal 
designs (Overall & Spiegel, 1969). Substantial 
misconceptions are apparent in his criticisms, 
since a numerical example shows that the 
method which he proposed as appropriate is 
precisely equivalent to our method that he 
criticized as incorrect. Our previous article 
compared three different methods used by 
psychologists for the analysis of data from non- 
orthogonal experimental designs, and it pointed 
out that two of the methods do not provide 
tests of main effects as main effects are usually 
conceived in analysis of variance. Rawlings’ 
criticisms overlook this purpose and imply that 
we presented all three methods as appropriate 
generalizations of the conventional analysis of 
variance for orthogonal designs. This will surely 
mislead readers who fail to refer to our article. 
Our purpose in the previous article was to show 
that the alternative methods test different 
hypotheses. 

After detailed comparisons, we previously 
concluded that our Method 2 is the appropriate 
generalization of conventional analysis of 
variance. It yields results identical to those 
provided by several Statisticians as examples 
of nonorthogonal analysis of variance (Rao, 
1965; Snedecor & Cochran, 1967). Neverthe- 
less, Rawlings (1972) states that although the 
method does “provide tests of hypotheses, 
these tests are not tests of main effects [p. 
313]" Ironically, he then recommends a 


method of patching together several different 
ЕЕ ан 

1 Requests for reprints should be 
Overall, Department of Neurology 


University of Texas Medical Branch, 
77550. 


sent to John E. 
and Psychiatry, 
Galveston, Texas 


designs, and programs for accomplishing the analyses are considered. 


computer programs to obtain an analysis di- 
rectly equivalent to our Method 2. 

A numerical example demonstrates 
equivalence of Rawlings’ analysis and the 
analysis previously proposed by us as the most 
appropriate generalization of orthogonal anal- 
ysis of variance. In Table 1 are data for a two- 
way analysis of variance with disproportionate 
cell frequencies. Our previous article presented 
results obtained from analyses of these same 
data by three different methods. The method 
which we concluded to be most directly anal- 
ogous to the conventional analysis of variance 
was Method 2. Rawlings’ proposals for non- 
orthogonal analysis of variance can be shown 
to be precisely equivalent to our Method 2, 
at least for the case that he chose. 

First, Rawlings proposes to calculate the 
error sum of squares by summing the squared 
deviation of observed scores about cell means: 


Sok = >is 25 (у) = Ry. 


The means for the 12 cells in Table 1 2 
calculated, and the squared deviations of zs 
served scores were then summed to yield th 
following: 


the 


SSy = (61 — 62)? + (73 — 62)? 
To +. (91 — 88,7)? = 2,269. 


x Е ЕЁ E » value 
We previously obtained exactly this same V: 


as a residual about the full regression а! 
The BMDOSV computer program ат fe 
1968) was employed, as recommende di 
Rawlings, to obtain appropriate sums re 
squares for testing main effects. To obtain 


e 
; ies s, th 
appropriate main effect sums of € the 
i Я saiae. 3i 

interaction terms were nol included 1 ra 


: m ta 
design matrix, The sums of squares obti 
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TABLE 1 


Dara ror Two-Way ANALYsIS OF VARIANCE 


Effect 
Effect — س‎ 
А. Аз Аз 
61 42 96 
Bı 73 53 81 
52 | 92 
| 
| 79 37 45 
В, 05 32 37 
81 50 | 
43 87 75 
By 35 | 81 59 
65 
56 72 98 
25 84 77 
m 19 91 
35 


from the Dixon program were 
SS4 = 2,729.6 
and 
5,5; = 779.0. 


These sums of squares for main effects are the 
Same as we previously obtained in Method 2 
analysis of these same data. The composite 
Sum of squares due to regression on both main 
effects can also be obtained directly from the 
Dixon program by inclusion of an appropriate 
"hypothesis сага.” It was found to be 


SSy = N'Y = 3,722.8. 
This latter sum of squares is Rawlings’ их 
With y = 5 degrees of freedom. It is required 


? The reader should be aware that the sum of squares 
due to regression on all main effects simultaneously can- 
not be obtained by simply summing the separate main 
effect sums of squares in a nonorthogonal design. ‘The 
Composite sum of squares for regression on all main 
effects can be obtained as the difference between the 
total sum of squares and the residual from the main 
effects model, 55и = SSr — SSR. _ КЕ 

è Two typographical errors occur in our original ar- 
ticle. The design matrix presented in Table 2 (p. 314) 
Contains a transposition of 0 and 1 in the final two 
entries in the row containing score 37. This transposition 
Was not present in our original matrix, and the analysis 
15 correct as presented. The error € for al three 
meth, are i ical: thus the extra А? in the error 
E are identical; tl heces: ua 

for Method 1 (p. 316) р 
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TABLE 2 


SUMMARY OF ANALYSIS OF VARIANCE OBTAINED IN 
THREE STEPS FOLLOWING RAWLINGS’ 
RECOMMENDED PROCEDURE 


Source | SS | df | MS F 
X | 2,729.6 | 2 12.0 
B 779.0 3 2.3 
AB | 9,5601 6 14.0 
Error | 2,269.9 20 

Total | 15,5529 


to compute the sum of squares for interaction 
in the manner proposed by Rawlings: 


55ав = FY — DXF — 556 
= SSr — SSu — 856. 


"Thus, in this example, the interactions sum of 
squares is 


— 2,270.0 = 9,560.1. 


The complete analysis of variance that was 
pieced together by calculating the error sum of 
squares from deviations about cell means, cal- 
culating the sums of squares for main effects 
using BMDOSV, and calculating the interac- 
tion sum of squares as a residual is shown in 
Table 2. These results are identical (within the 
margin of rounding errors) to those previously 
presented for analysis of these same data by 
Method 2 (Overall & Spiegel, 1969).* 

It is of interest to examine some potential 
difficulties in the use of his recommendations 
lor analyses involving more complex multi- 
classification designs. First, the calculation of 
an error term directly from deviations about 
cell means is not practical. In а 14-way fac- 
torial design, for example, many cells may be 
empty or may contain only a single observa- 
tion, and thus deviations from the cell means 
would be undefined. It is frequently difficult 
even to define all of the possible cells in a com- 
plex multiclassification design. The correct sum 


4 Psychologists are familiar with the problem of 
interpreting main effects in the presence of significant 
interactions. In “fixed effect" factorial designs, one 
usually considers main effects only if interactions in- 
volving those factors are not significant (Snedecor & 
Cochran, 1967). This problem is no different in non- 
orthogonal analysis of variance. 
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of squares for error can be obtained as a re- 
sidual in the complete regression analysis. It 
is correct even if the design contains empty 
cells, provided that reasonable assumptions can 
be made about the absence of certain higher 
order interactions. 

While the conception advanced by Rawlings 
of a single simple interaction as representing 
“lack of fit” to the main effect model is per- 
fectly acceptable, a very real problem arises in 
more complex multiclassification designs. The 
section of Scheffé (1959, pp. 112-116) cited by 
Rawlings is concerned specifically with a simple 
two-way design. 

Suppose one were to undertake a 10-way 
factorial instead. There would be 45 simple 
interactions plus a larger number of higher 
order interactions, It would be hazardous to 
lump together all interaction sums of squares 
to test the composite deviation from the main 
effect model against the within-cells error, even 
though such a test may be valid under an ap- 
propriately specified model. The presence of as 
many as a dozen significant interactions could 
easily be masked by the 1,000-plus zero inter- 
actions with which they are combined. In com- 
plex analyses, it seems important to include in 
the model interaction terms that are hypothe- 
sized to be present, to break out the parate 
sums of squares, and to test the specific inter- 
action effects separately. Interactions assumed 
à priori to be nonexistent can be pooled into the 
error term. Interactions that cannot be as- 
sumed a priori to be absent should be included 
in the model. 

Implicit in Rawlings’ proposed analvsis is 
the testing of a composite interaction sum of 
Squares which, if not significant, is pooled into 
the residual error for testing main effects Due 
to the low power of the composite test against 
specific interaction effects, such procedure 
Should not be recommended for complex multi- 
classification designs. The pros and cons of 
pooling have been discussed frequently with 
regard to orthogonal analysis of variance 
(Winer, 1962, p. 202), and the general problem 
is no different in the nonorthogonal case. The 
present writers do not favor pooling of inter- 
action sums of squares merely because of failure 
to find them statistically significant, but this is 
perhaps only a conservative preference on our 
Part. In any event, the question of pooling has 
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no unique relevance in consideration of non- 
orthogonal analysis of variance.” 

In concluding this comment, a brief con- 
sideration of computer programs may be In 
order. The BMDOSV program recommended 
by Rawlings for calculating sums of squares for 
main effects is adequate for the entire analysis, 
provided one recognizes that it accomplishes a 
general linear model analysis that is precisely 
Method 1 of our previous article. As pointed 
out by Rawlings, Method 1 provides correct 
tests of interaction effects, and we have added 
that a correct sum of squares for error can also 
be obtained directly from the Method 1 anal- 
s. If one wishes to accomplish a Method | 
analysis using BM DOSV or any other Methoc 
1 program, two passes are required, First, 2 
complete model including main effects and p 
teractions is specified. This yields sums © 
squares for interactions and for residual error. 
Next, a reduced model involving only man 
effects is specified. This yields sums of squares 
for main effects. If Rawlings were going to 
recommend use of BMDOSV for a portion be 
the analysis, it is puzzling why he did not rec” 
ommend its use for all of the analysts. 
course, if one desires a Method 2 analy | 
which Rawlings accepts as the appropriate 
method, it would be nice to have a compute! 
program which produces that analysis directs 
Such a program has been provided by Over 
and Klett (1972, pp. 458 468). 


Rawlings 
de con” 
tation 
action: 

vious 
d t0 


In his accompanying reply to this note, 
(1972) has modified his original proposal to inclu 
sideration of pooled error estimates and interpr 
of main effects in the presence of significant inter? 
These considerations were not included in his Dti 
proposal of an analysis which we have demonstra ors 
be equivalent to our Method 2. He also grossly "i cd 2. 
resents our recommendations with respect to Meth ate 
We did not recommend Method 2 as most appre 
“when results suggest departure from additivity 
actually stated that “Method 2 is appropriat 
one hopes to account for systematic variation ! 
of simple additive main effects but wishes ps ast non- 
significance of interactions as a safeguard agains 
additivity [p. 319]." 


e where 
n terms 
the 


test 


REFERENC d 
Dixox, W. J. (BMDOSV) In, Biomedical computer P 
grams. Los Angeles: University of California 
1968. | 
GRAYBILL, F. A. An introduction to linear 
models. New York: McGraw-Hill, 1961. 


КОЛ 


COMMENT ON RAWLINGS 67 


OVERALL, J. E, & Klett, C. J. Applied multivariate 
analysis. New York: McGraw-Hill, 1972. 

OVERALL, J. E., & SPIEGEL, D. К. Concerning least 
squares analysis of experimental data. Psychological 
Bulletin, 1969, 72, 311-322. 

Rao, C. К. Linear statistical inference and its applica- 
tions. New York: Wiley, 1965. 

RawriNGs, R. R., Jk. Note on nonorthogonal analysis 
of variance, Psychological Bulletin, 1972, 77, 373-374. 


SCHEFFÊ, Н. The analysis of variance. New York: Wiley, 
1959. 

SNEDECOR, С. W., & COCHRAN, W. G. Statistical 
methods. (6th ed.) Ames: Iowa State University 
Press, 1967. 

WINER, B. J. Statistical principles in experimental design. 
New York: McGraw-Hill, 1962. 


(Early publication received June 15, 1972) 


Psychological Bulletin 


1973, Vol. 79, No. 3, 168-169 


COMMENTS ON 


THE OVERALL AND SPIEGEL PAPER 


ROBERT R. RAWLINGS, JR. 


National Institute of Mental Health, Rock: їйє, Maryland 


This note is a reply to the 
that the method proposed 
Rawlings are not equivalent. 


The purpose of this note is the clarification of 
; certain erroneous interpretations concerning 

onorthogonal analysis of variance which 
ear in Overall and Spiegel (1973). In part, 
this note I demonstrate that their Method 
2 is not equivalent to the method described by 
Rawlings (1972) referred to here as Method 1'. 

First, I discuss in terms of hypothesis test- 
ing, the three methods for performing the 
analysis of variance described by Overall and 
Spiegel (1969). The model used in the discus- 
sion is the two-factor interaction model 


Yük = u+ а + B; 4 yi Ck 

in which all cells are filled, We denote the pop- 
ulation cell means by u; and the cell frequencies 
by ng, i= 1, nnd :کے .وا اک‎ AS {б Well 
known the test for ап A effect in Method 1 is a 
test of the hypothesis that the A; = р; — Ā.. 
are equal for all 7, or equivalently it tests the 
hypothesis that a;* = æ; — а + ӯ. — 7.. are 
equal for all i. This hypothesis is not data de- 
pendent and it is based upon the traditional 
definition of main effects Aj. 

From pages 304 and 310 of Searle (1971) it 
is observed that the Method 2 test for an A 
effect is a test of the hypothesis that for all j 


1 1 А 
"gs, 
2 [7 = > 024; ;] = 0 


j=1 Ny 
hypothesis that for 


7=1 


or equivalently a test of the 
all i 


J Й 
поп\ | g ni 
-E(Et9.. E = ey, 
tæi Муз Aj ne nj 
I 
1 nin; 
-AEX[(SH o 
йа уа ny 
as 
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article by Overall and Spiegel. 
by Overall and Spiegel 


This note demonstrates 
and the method proposed by 


From page 307 of Searle (1971) it is ob- 
served that the Method 3 test for an A effect 
is a test of the hypothesis that the 

1 J 
5 пошу 
Hi. j=l 


are equal for all i, or equivalently that the 


1 
a; 
H 


f 


1 
У 1(8; + Yu) 
j=l 


are equal for all i. 

In Overall and Spiegel (1969) it is recom- 
mended that Method 2 is the most appropri- 
ate method of analysis when results suggest 
departure from the additive model, However, 
since the hypothesis corresponding to Method 2 
is complex and data dependent, it is rather 
doubtful that it is useful. Also, in the Overall 
and Spiegel article it is suggested that Method 
3 may be more appropriate in testing a hier- 
archy of effects. But since Method 3 has a 
hypothesis that is data dependent and depen- 
dent upon the B effect and interactions, it is 
doubtful that this method is very useful. 

On pages 316-318 Searle (1971) discusses 
the difficulties associated with these hypothe- 
ses. When interactions are present, it appears 
to this author that Method 1 at least possesses 
а more meaningful interpretation as in Schellé 
(1959) (the effects of the levels of one factor 
аге averaged over the levels of the other 
factor). 

I refer to the test for ап A effect using only 
the noninteraction model as Method 1’. When 
there are no interactions it can be shown that 
Methods 1 and 2 are equivalent to the hy- 

sis that ; u аге al, which is the 
pothesis that all a; are equal, i bes 
same hypothesis tested Ьу Method 1 : It ү 
also be shown that when we have no р 
tions and proportionate cell frequencies, ise 
Method 3 tests the same hypothesis. a 
Methods 1, 2, and 1’ test the same hypothes 
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when there are no interactions, one should use 
that statistic which provides the maximum 
power for the test of the hypothesis. In the 
following, I discuss Method 1' as an improve- 
ment of Method 1 for the test of main effects. 
Also, I describe the extension of Method 1' to 
designs with more than two factors. In Rawl- 
ings (1972) it is demonstrated mathematically 
that for large V, Method 1’ will always have 
larger power than Method 1 in the nonorthog- 
onal designs (the powers are equal їп the 
orthogonal case). This fact was the basis for 
my suggestion to use Method 1’ rather than 
Method 1. Mattsson and Rawlings? investi- 
gated the effects of different nonorthogonal 
designs on the power of the above statistical 
tests, and the numerical experiments con- 
sistently support the above conclusion. By 
similar arguments one can conclude that for 
an analysis with more than two factors one 
should retain only the significant interactions 
in the model when testing for any "appro- 
ı4 7 priate” main effects if one is to increase the 
power of the main effects tests. 

If there are more than two factors, then 
Method 1 is described as follows: (a) Test for 
interactions and main effects using the com- 
plete interaction model. For those factors 
possessing significant interactions with other 
factors the main effects tests can be interpreted 
as above. If interactions are unlikely then one 
E eer 

2N. B. Mattson and R. R. Rawlings, Jr. Problems 
of power in nonorthogonal models. (Tech. Rep. CSB- 


SAS-2) Washington, D. C.: National Institute of Men- 
tal Health, 1972, 
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can perform a lack-of-fit test instead. (b) Test 
the remaining factors for main effects using a 
model containing only those interactions which 
were found to be significant in a above. In 
order to maintain an experimentwise signifi- 
cance level a, one should split up a between a 
and 5, say a1 and os such that ai + a» = a as 
in Miller (1966). In brief, Method 1 is simply 
the use of Method 1 on two different interac- 
tion models. 

If Method 2 is to be proposed as an improve- 
ment over Method 1 then the following prob- 
lems remain to be resolved: (a) If there are 
significant interactions in the general N factor- — 
model, but there are no interactions involving 
Factor А, then show that the test of an A effect 
reduces to a test of the same hypothesis as that 
tested by Method 1. (5) Demonstrate mathe- 
matically that Method 2 has larger power than 
Method 1 for detection of main effects, at least 
under certain conditions, 
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COMMENT ON JOE'S "COMMENT ON 


OVERALL AND SPIEGEL'S 


'LEAST SQUARES ANALYSIS OF EXPERIMENTAL DATA' " 


I. LEON SMITH ! 


University of Cincinnati 


The general linear model is discussed with respect to the conditions of orthogonality 
and equal subjects per cell in an analysis of variance application. 


Joe’s (1971) discussion and attempted clari- 
fication of the Overall and Spiegel (1969) 
article contains a basic error. He suggested 
that in the special case of equal subjects per 
cell, T'A'AT will be a superdiagonal matrix. 
However, his own illustration of a two-way 
design with two levels of A and three levels of 
B with equal rather than unequal subjects per 
cell does not support this statement. Consider 
his example with two subjects per cell: 


[Xii 1 1 0 L uU 
Xen I i 0 1 Q 
Ха 11 0 0 1 

Xa 112001 

Mins i 10 0 9 

- Vais 119 9 D 
im Xin AT7l101120 
Xon 10110 

Хз 1102 O 1 

1l 9 l 9 1 

10100 

1010 0 


Since he wishes to form the dummy vari- 
ables, (a1 — аз), (81 — Вз), (8: — Bs), (a1 — оз) 
X (Bı — Bs), and (о — as) (Bs — Вз) from A, 


the appropriate T matrix is: 


o} [0 


0 0 0 0—1 1 
O X 0 Qd 0 
0 0 0 0 0-1 


B 
1 2 3 
1 Хап Хит Vans 
Von 
1 2 Via | 
Voor 


In terms of the general linear model for this 
design, 


0100000 s 
010000 o| 
0010000 аз 
001 0 00 0 B 
1001000 " 
10010200 m" 
oowoo too = oii |" 6I 
0000100 ab» 
0000010 abs 
0000010 n 
10 0000 1 ab» 
10000 01 abs 


The square symmetric T'4^4 T matrix for this 
example is: 


12 0 0 9 Q 0 
12 Q0 9 O O 


8 4 0 0 [3] 
ё 0 u^ 
8 4 
8 


The matrix, of course, is not diagonal, even 1 
the case of equal subjects per cell. 

1 Requests for reprints should be sent to 
Smith, who is now at Curriculum Research D ad 
velopment Center in Mental Retardation, d 
Graduate School of Humanities and Social A. Aw 
Yeshiva University, 55 Fifth Avenue, New York, + 
York 10003. 
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The reason for this rests in the failure to form orthogonal dummy variables or contrasts 


from A. In the example, -1T equals: 


(oi = аз) (o f a») 
u (аа) (8ı 8:) (8: — 8:) (81 — 8s) (8: — Bs) 
T 1 1 0 1 0 
1 1 1 0 1 0 
1 1 0 1 0 1 
1 1 0 1 0 1 
1 1 —1 —1 -1 —1 
1 1 —1 —1 —1 —1 
1 —1 1 0 —1 0 * BJ 
1 =1 1 0 -1 0 
1 —1 0 1 0 —1 
1 e 0 1 0 —1 
1 eui ES -1 1 1 
1 ex —1 =i 1 1 


In order for contrasts to be orthogonal, the 
columns of 17 must be independent—the sum 
of products of elements of any one column 
times another must be zero. In the case of the 
two contrasts for the B effect, as well as the 
interactions, the product is 4, which is iden- 
tical to the offdiagonal elements in Matrix 3. 
Thus, 774/47 will be a diagonal matrix when 
there are equal subjects per cell, but only when 
the contrasts or dummy variables are orthog- 
onal. Since Joe selected nonorthogonal con- 


trasts from А, his Equation 13 will not yield 
the appropriate diagonal elements of 7’A’AT 
when both of the above conditions are met. 
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JUDGMENTS AND EXPLANATIONS AS CRITERIA FOR 
THE PRESENCE OF COGNITIVE STRUCTURES 


CHARLES J. BRAINERD ! 


University of Alberta 


The current methodological disagreement. over whether subjects’ judgments or 
their explanations on Piagetian tasks should be viewed as the minimum neces- 
sary evidence for the presence of underlying cognitive structures is reviewed 
and resolved. The Principal features of Piaget’s theoretical model of cognitive 
Structure are summarized as a means of resolving the disagreement. Two 
conclusions are authorized by this summary: (a) Explanations constitute in- 


appropriate bases for 


inferring the presence/absence of cognitive structure 


because they are subject to at least two sources of systematic Type II error. 
(b) Judgments constitute appropriate bases for inferring the Presence/absence 
of cognitive structure, because—from the standpoint of Piagetian theory—they 
are not subject to any known source of systematic error. 


In recent years, Piaget (e.g. 1970a, 
1970b) has reemphasized a fact about his 
theory of intellectual development which has 
received notably insufficient attention in the 
Piaget replication literature. The theory is— 
in its most basic sense—an account of the 
ontogenesis of a particular theoretical con- 
struct, cognitive structure. By implication, 
all of the now-familiar Piagetian dependent 
variables — (e.g., conservation, transitivity, 
class-inclusion), mental processes (e.g., in- 
version, reciprocity), and laws (e.g.. equili- 
bration) serve but two broad ends, to deter- 
mine the presence and the nature of cognitive 
Structures. It is with the former of these two 
matters that the present paper is concerned. 

The number of replications of the widely 
scattered reports of Piaget’s original studies 
of intellectual development grew steadily 
during the past decade (cf. Brainerd & Allen, 
1971a; Flavell, 1963, 1970; Hooper, Gold- 
man, Storck, & Burke, 1971, for reviews). 
Perhaps the most noteworthy feature of this 
literature is the series of objections, some 
trivial, but most justified, that have been 
raised regarding Piaget’s “clinical method.” 
From the time Braine’s doctoral dissertation 
and a subsequent monograph based on it 
(Braine, 1959) appeared until the present, 
these objections have fallen into two cate- 


Sories, "stimulus" criticisms and "response" 
сэн. Жырны 

1 Requests for reprints should be sent to Charles 
J. Brainerd, Department of Psychology, University 
of Alberta, Edmonton 7, Alberta, Canada. 
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criticisms. "The stimulus criticisms (e.g., 
Braine, 1959, 1962; Braine & Shanks, 1965a, 
1965b; Hall & Kingsley, 1968; Smedslund, 
1963, 1969) concern methods of task presen- 
tation and they include the usual sources of 
Type II error in studies of children's reason- 
ing: inattentiveness, forgetting, misunder- 
standing instructions, etc, The response criti- 
cisms concern the specific behaviors which 
are used to infer the presence of cognitive 
Structures, that is, how shall structures be 
known? To date, the principal point of con- 
tention on the response side has been whether 
children's judgments or explanations of these 
judgments are the minimum necessary evi- 
dence that the structure presumed to underlie 
the solution of some given task is in fact 
present. А 

The purposes of this paper are to provide 
both a brief, selective review and, more im- 
portant, a resolution of the second source of 
controversy, the response criticisms, We shall 
see that reviews of both the salient features 
of Piaget’s model of cognitive structure and 
currently available classificatory schemes for 
explanations permit a resolution in favor of 
judgment responses, 


с Y 
$СОРЕ or THE PROBLEM AND PRELIMINAR 
DEFINITIONS 


Since this paper is selective in the pae 
of specific publications cited, some prend 
remarks are in order concerning the us di 
of phenomena to which the judgment/expla 


CRITERIA FOR COGNITIVE STRUCTURES 


nation controversy applies and the impor- 
tance of its resolution. Also, it is advisable 
to be clear at the outset about what is meant 
by a judgment and an explanation in the 
context of Piagetian tasks. 


Scope 


Although the range of dependent variables 
to be considered is restricted, the methodo- 
logical controversy which they illustrate arises 
in all studies involving tasks designed to 
assess Piagetian cognitive structures. Thus, 
all of the numerous tasks which Piaget has 
employed in his studies of the various levels 
of cognitive structuration are relevant. For 
present purposes, however, the range will be 
limited to the standard (Flavell, 1970, p. 
1003) methods of determining the presence 
of the class and relational structures of 
the concrete-operational period: conservation 
problems, seriation problems, class-inclusion 
problems. 

Regarding the need to settle this contro- 
versy, it is not, as some might suppose, 
necessary to resolve it for the sake of estab- 
lishing precise age norms for the appearance 
of Piagetian cognitive structures. Although it 
was in connection with age norms that the 
controversy first appeared, age norms are 
trivial issues from the standpoint of Piagetian 
theory because the theory leaves wide latitude 
regarding them. Instead, it is the hypothe- 
sized correspondences and sequences between 
structures which are the crucial empirical pre- 
dictions, The theory leaves no latitude in 
these matters, Given two cognitive structures 
A and B, either they are acquired in strict 
synchrony or one invariably precedes the 
other. It is from the perspective of a need 
for further research on correspondences and 
sequences in the acquisition of cognitive 
structures that the judgment/explanation is 
considered herein. We simply cannot hope to 
get on with the business of evaluating spe- 
cific predictions until a consensus on response 


criteria is reached. 


Definitions 


Consider the standard Piagetian tasks for 
assessing the presence of concrete-operational 
Structures: (a) conservation problems, (5) 
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three-term  seriation problems, (c) class- 
inclusion problems. In а, the child is shown 
two objects that are perceptually identical 
and equivalent with respect to some quanti- 
tative property (e.g., mass). One of the ob- 
jects is altered perceptually but not quanti- 
tatively. The experimenter then determines 
whether or not the child understands that 
quantitative equivalence is conserved across 
the perceptual transformation. In 5, the child 
is shown three objects (A, B, C) which differ 
on some quantitative dimension (e.g., length). 
The experimenter demonstrates the relation- 
ship between the members of two of the 
three possible pairs (e.g., A < B, B < C) and 
then determines whether or not the child can 
deduce the relationship between the memb 
of the third pair. In c, the child is shor 
two or more numerically unequal sets of ob- 
jects (e.g., six horses and three cows). The 
experimenter determines whether or not the 
child infers that the number of elements in 
the superordinate set (animals) must be 
greater than for either subordinate set. 
Traditionally, inferences about the presence 
of the cognitive structures which the pre- 
ceding tasks presumably assess have been 
based on one or both of two sorts of re- 
sponses, judgments and explanations. A judg- 
ment is generated as follows: A dichotomous 
question is posed about some quantitative 
relation which holds between the objects 
under consideration. The question is dichoto- 
mous in that the domain of possible responses 
is limited to two alternatives—either the ob- 
jects are the same or they are different. To 
illustrate, in the conservation problem out- 
lined above the child would be asked to judge 
whether or not the perceptually dissimilar 
objects were quantitatively equivalent. Since 
judgment responses are a dichotomous ran- 
dom variable, many possible response modes, 
in addition to verbalization, can be employed 
(e.g., cf. Beilin, 1965, 1968). An explanation 
is generated as follows: Following dichoto- 
mous judgments, the experimenter poses ques- 
tions of the form “Why?” “How do you 
know that?" Explanations obviously restrict 
the domain of possible response modes to 


one—verbalization. 
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ORIGINS OF THE CONTROVERSY AND 
Basic Issues 


In 1959, Braine published a monograph 
ostensibly devoted to replicating Piaget, 
Inhelder, and Szeminska's (1960) studies 
of two concepts, transitivity (Problem b 


above) and discrimination of ordinal position. 


Кару ài measurement techniques were radi- 


y p from those reported by Piaget 

al. In particular, extraneous verbal factors 
from both the stimulus side (by employing 
a reinforced pretraining procedure) and the 
response side (by defining satisfactory per- 
formance in terms of seeking hidden candy) 
were eliminated, Braine's innovations might 
have gone unchallenged if it had not been 
for the fact that his data were at odds with 
Piagets in one important respect. While 
Piaget found that transitivity and ordinality 
were acquired between the ages of 7 and 8, 
Braine reported that they were acquired 2 
years earlier, Braine concluded that such a 
large discrepancy seemed to undermine the 
credibility of Piagetian theory. 

Braine's findings soon provoked a lively 
exchange with Smedslund on “transitivity 
criteria." Throughout the exchange, both 
authors focused largely on stimulus issues. 
Presentation methods were scrutinized me- 
ticulously: Smedslund (1963) enumerated 
three possible sources of Type I error and 
three possible sources of Type II error in 
“psychological diagnostics” and he argued 
that one of the sources of Type I error 
(solutions by irrelevant hypotheses) prob- 
ably accounted for Braine's data. Smedslund 
presented replication data of his own that 
supported Piaget's norms. Braine (1964) re- 
plied that Smedslund's criticisms were “far 
fetched” in that although solutions by ir- 
relevant hypotheses were possible, there was 
no evidence {ог their actual occurrence. 
Braine presented fresh data which failed to 
reveal such errors and which replicated his 
original findings. In a rejoinder, Smedslund 
(1965) advanced the additional charge of 
“pseudomeasurement,”? Employing pseudo- 
measurement as a point of departure, Smeds- 
lund (1966) reported further data in support 
of Piaget’s norms, During the s 


ame interim, 
Braine applied his methods to 


conservation 
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problems (Braine & Shanks, 1965a, 1965b) 
and found that conservation concepts also 
appeared 2 years earlier than Piaget had 
reported, 

It remained for Gruen (1966) to observe 
that the discrepancy between Braine's and 
Smedslund's data might be due in part to 
the different response criteria that each em- 
ployed: Braine's criterion for the presence of 
transitivity was judgments only, while Smeds- 
lund's (and Piaget's) criterion was judgments 
plus adequate explanations. Gruen proceeded 
to reanalvze the data of his doctoral disserta- 
tion (Gruen, 1965), which was concerned 
with number conservation, and he found, in 
line with his hypothesis, that more subjects 
were inferred to possess conservation when 
the judgment criterion was applied than when 
the explanation criterion was applied. On the 
basis of this finding, Gruen concluded that 
the judgment criterion is more consonant with 
non-Piagetian theories of cognitive develop- 
ment (e.g, Bruner, 1964), while the expla- 
nation criterion is more consonant with 
Piagetian theory. Subsequent research has 
either directly (Roodin & Gruen, 1970) or 
indirectly (Brainerd, 1971a, 1971b) sup- 
ported the proposition that the judgment 
criterion is the less stringent of the two. 

Thus, we come to the crux of the methodo- 
logical issue which confronts us: What infer- 
ences are to be made about subjects who 
provide correct judgments on Piagetian 
tasks but not satisfactory explanations? By 
Braine's criterion, one either must infer that 
the relevant cognitive structure is present or 
commit Type II errors. By Smedslund's cri- 
terion, one either must infer that structure 
is absent or commit at least some Type I 
errors. In the two sections which follow, we 
Shall review those points which can be 
teased out of Braine's and Smedslund's many 
"stimulus" arguments in support of their 
respective response criteria. 


Braine’s Position and Its Insu fficiencies 


Although the judgment criterion is associ- 
ated with Braine's name herein, many other 
investigators also have employed this criterion 
almost exclusively (e.g, Bever, Mehler, & 
Epstein, 1968; Calhoun, 1971; Mehler & 
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Bever, 1967). Hence, the association is simply 
a matter of historical precedent. 

As it turns out, all but one oí the various 
methodological arguments which Braine has 
advanced are strictly peripheral to issue 
under consideration because they pertain only 
to stimulus matters. The argument which is 
relevant appeared first in Braine (1959): 


“The reason why Piaget’s results differ from those 
reported here is not simply that there is a differ- 
ence in experimental procedures, but is, the writer 
believes, because in designing his experiments Piaget 
fails to eliminate important variables which are not 
involved in the definition of the processes he sets 
out Lo investigate |p. 16, italics added ]." 


Later, Braine (1962) offered illustrations of 
the sort of response criteria which were 
thought to follow from this argument. Bruner, 
Goodnow, and Austin’s (1956) concept for- 
mation “strategies” were cited as cases in 
point: Bruner et al. first recorded dichoto- 
mous judgments on a series of simple con- 
cept identification tasks and then searched 
the responses inductively to determine 
whether or not there were any reliable pat- 
terns, Patterns thus identified were viewed 
as underlying processes, 

Braine’s argument and illustration clearly 
constitute a less than sufficient rationale 
for employing a judgment criterion with 
Piagetian tasks. It is charged that Piaget’s 
methods are inconsistent with the "definition 
of the processes he sets out to investigate,” 
but no direct evidence that Braine’s meth- 
ods provide a better fit with the definition 
is offered. This yields a rather paradoxical 
situation: If one wishes to argue that 
Piaget’s methods do not square with his 
theory, it would seem imperative that one 
provide both a scrupulously precise synopsis 
of the theory and an equally precise state- 
ment of how one’s own methods are more 
clearly authorized. For some unknown reason, 
Braine has done neither of these things; 
perhaps this is one more example of Gordon 
Allport’s old principle of functional autonomy 
of methodology in psychological writing. In 
any event, the failure both to state the theory 
of cognitive structures precisely and to de- 
duce the judgment criterion from the theory 
Makes the quoted argument appear to be 


175 


little more than a standard plea for better 
operational definitions. 


Smedslund’s Position and Its Insu fficiencies 


The association of Smedslund’s name with 
the explanation criterion also is largely a 
matter of historical precedent. Other advo- 
cates of this criterion, notably, some of 
Piaget’s chief collaborators (Inhelder, Bovet, 
Sinclair, & Smock, 1966; Inhelder & Sinclair, 
1969) have been even more vocal than 
Smedslund in their contention that “special 
attention should be paid to the child's justi- 
fication of his answers [Inhelder & Sinclair, 
1969, p. 5].” Nevertheless, it is Smedslund’s 
methodological tracts which purport to supply 
the rationale for the criterion. 

As noted above, Smedslund (1963, 1969) 
has enumerated three sources of Type I 
errors encountered with Piagetian tasks. 
These are guesses, direct perception, and ir- 
relevant hypotheses. Since the first two can 
easily be eliminated by certain refinements 
in presentation techniques (cf, Smedslund, 
1969), only the third source need concern us. 
Irrelevant hypotheses refers to the fact that, 
occasionally, subjects fix on irrelevant hunches 
which permit correct judgments but zot cor- 
rect explanations. In Problem b above, for 
example, the subject might label objects that 
are presented first “little.” Subsequently, 
when asked which of the three objects is 
least, the subject judges that A is “because it 
came first.” Hence, presentation order forms 
an irrelevant basis for correct judgment. 

Although irrelevant hypotheses have been 
invoked to justify the explanation criterion, 


? Another argument which Smedslund occasionally 
has employed to justify the explanation criterion is 
that the subject must prove that he sees the logical 
necessity of a given Piagetian concept. The author 
does not deal directly with this argument in the 
present paper, because it simply does not apply to 
the entire range of Piagetian concepts—not even to 
the three to which this paper is restricted: Many 
Piagetian tasks involve only approximate physical 
properties of objects about which there is no logical 
necessity at all. This is especially true of conserva- 
tion problems (cf. Piaget, 1961) whose "logic" is 
disproven in modern relativity theory. By implica- 
tion, Smedslund's argument neither has been nor can 
be deduced from the theory—which acknowledges a 
difference between logical and physical experience 
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they fall short of being an adequate rationale 
for two basic reasons. First, they—like the 
other sources of Type I error—can be re- 
solved by stimulus refinements; studies by 
Murray and Youniss (1968) and by Youniss 
and Murray (1970) have eliminated such 
errors by, among other things, varying pres- 
entation orders? Second, Smedslund's argu- 
ments are characterized by the same weak- 
ness as Braine's—insufficient attention to the 
theory of cognitive structures. Like Braine, 
Smedslund has failed to show how his cri- 
terion fits with the theory. Instead, he has 
advanced methodological arguments (Type I 
error) when the only adequate rationale is a 
theoretical one. Hence, even if the third 
source of Type I error could not be resolved 


via stimulus refinements, the criterion would 
still be arbitrary, 


THE THEORY or COGNITIVE STRUCTURES AND 
A RESOLUTION OF THE CONTROVERSY 


Cognitive Structures Defined 


There are two sides to Piagetian structural- 
ism (e.g., Piaget, 1970b). First, there is a 
metatheoretical definition of the concept of 
Structure suitable for application to all tradi- 
tional academic disciplines. Second, there is 
a theory of psychological structure. Only the 
latter will concern us here, 

Piaget postulates two 
psychological structure 
and cognitive structure 
EL voc 

3 For the sake of completeness, it must be pointed 
out that {һе Murray-Youniss/Youniss-Murray 
studies do not prove the existence of irrelevant 
hypotheses in any strict sense. Instead, all that has 
been demonstrated is that an A=B-+C transi- 
tivity assessment paradigm is more difficult than the 


standard AABAC paradigm (Problem b above). 
While it is i i i 


principal forms of 
, perceptual structure 
- Two distinctions are 


three-term series 
ical relations. or 
“©, C. Т. Brainerd, 


childhood: Recent 
research оп Piaget's concrete-operationa] group- 


ments. Paper read at the Third Annual Conference 


on Structural Learning, Philadelphia, March-April 
1972.) 
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drawn between perceptual and cognitive 
structure. First, perceptual structures are 
said to undergo only quantitative change 
after birth, while cognitive structures are said 
to undergo qualitative change after birth. 
In short, perceptual development amounts to 
refinements in function, while cognitive de- 
velopment amounts to gross changes in archi- 
tecture. Second, perceptual structures pre- 
sumably evolve in the womb, while cognitive 
Structures presuppose internal representation 
(Piaget, 1950, 1954) and, therefore, cannot 
begin to evolve until the second year of life. 

Turning to the formal properties of cogni- 
tive structures, they are defined as internal- 
ized wholes whose laws of composition are 
mental operations апа whose self-regulatory 
rule is the principle of equilibration. The 
function of a cognitive structure is to inter- 
pret (assimilate) reality so that the cognizer 
can behave intelligently. Different levels of 
assimilation specify different levels of struc- 
ture; for example, the cognitive structures of 
the concrete-operational level (middle child- 
hood) permit only the assimilation of con- 
crete facts, while the cognitive structures of 
the formal-operational level (adolescence) 
permit the assimilation of purely conjectural 
information. Concerning the developmental 
Origins of cognitive Structures, Piaget has 
been interested in only two possibilities: Are 
the structures *logical"—in the sense of being 
engendered by all that is intelligent in human- - 
action—or are they linguistic—in the sense 
of being embodiments of the inherent struc- 
ture of man's natural languages? In contrast 
with learning-oriented accounts of cognitive 
development (e.g., Kendler, 1963), Piaget 
opts for logic rather than language and main- 
tains that cognitive structures originate in 
the consolidation of certain general properties 
of overt action (e.g., inversion, reciprocity). 
Piaget (1970b, Ch. 5) even has gone so far 
as to agree with Chomsky's (1966) claim that 
the structures of language itself are “logical,” 
Flavell (1963) indirectly provided perhaps 
the most apt comment on this matter when 
he observed that "language behavior is here 
treated as a. dependent. variable with cogni- 
tion as the independent variable Ip. 271] 
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Resolution 


Two conclusions, which entail a. resolution 
of the judgment/explanation controversy fol- 
low from the preceding account of cognitive 
structures: (а) The theory not only fails to 
justify an explanation criterion, but it makes 
such a criterion seem highly inappropriate to 
the task of determining the presence of 
cognitive structures. (b) The judgment cri- 
terion seems well-suited to the task of deter- 
mining the presence of cognitive structures. 
Conclusion a follows mainly from the postu- 
lated nonverbal (or, more generally, non- 
symbolic) nature of Piaget's structures. If, 
to paraphrase Flavell (1963), language is 
strictly a “dependent variable" vis-à-vis 
cognitive structure, then lengthy verbal ra- 
tionales of the judgments made on Piagetian 
tasks amount to suficient but not necessary 
conditions for the presence of structure. 
More simply, it follows from the theory that 
the set of all subjects evidencing adequate 
explanations on a Piagetian task is a proper 
subset of the set of all subjects who possess 
the structure which the task assesses. There- 
fore, if one makes adequate explanations a 
necessary precondition for inferring the pres- 
ence of structure, as Smedslund and some of 
the Genevans do, then one has built in a 
source of Type II error and virtually insured 
inflated age norms for the acquisition of 
specific structures. 

. Turning to Conclusion б, the suitability 
of the judgment criterion follows both from 
the assimilative function of cognitive struc- 
tures and from their logical origins. Concern- 
ing the former, the basic proof for the pres- 
ence of a given cognitive structure must, by 
definition, be a proof that an act of under- 
standing appropriate to that level of structur- 
ation has taken place. By implication, if we 


° wish to avoid being unduly conservative, then 
.We require the minimum necessary evidence 


that an act of understanding has occurred. 
Obviously, both the judgment and explana- 
tion criteria presuppose such assimilative 
acts; however, the latter is more conservative 
than the former. Since neither criterion is 
Subject to any known source of Type I 
error (by the present analysis), judgments 
are more satisfactory than explanations as a 
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criterion for the presence of cognitive struc- 
tures. Concerning the logical origins of cogni- 
tive structures, one should, in theory, opt for 
the response criterion which is the least 
restrictive regarding the domain of specific 
response systems. Again, the choice is clear: 
While judgments can incorporate many di- 
verse response modes (e.g, verbalization, 
pointing, level pulling), explanations incorpo- 
rate only the verbal mode of response. In 
short, if one chooses to employ the explana- 
tion criterion, then one unduly restricts the 
behavioral domain to which the theoretical 
construct (structure) applies. This, of course, 
amounts to another Type II error. 

Before summing up, the author should like 
to offer two further reasons, one substantive, 
the other pragmatic, for preferring the judg- 
ment criterion over the explanation criterion. 
Speaking substantively, there is the matter of 
the very significant research on cognitive 
development in deaf subjects which Furth 
(e.g., 1964, 1971) has been conducting: 
Piaget long has maintained that one reason- 
able way to approach his hypothesis that 
cognitive structures originate in logic rather 
than language is via the study of deaf chil- 
dren. Piaget's major predictions are that 
both the specific structures and their order 
of emergence will be the same in deaf and 
normal children. Obviously, опе must 
study judgment responses with the language- 
deficient deaf. Speaking pragmatically, there 
has been a notable lack of uniformity (to 
date) among the classification schemes which 
investigators have applied to explanation data 
stemming from Piagetian tasks. Unfortu- 
nately, little or no consensus exists among 
such investigators on the matter of what 
constitutes an adequate explanation. While 
there is extensive agreement that an adequate 
explanation must be preceded by a correct 
judgment, there is little agreement beyond 
this point. To illustrate this claim, consider 
two recent studies of quantity conservation 
(Problem а above), the first by Lloyd 
(1971) and the second by the writer 
(Brainerd, 1971b). Suppose that both studies 
included at least one subject who makes con- 
sistently correct conservation judgments (i.e. 
concludes that objects remain quantitatively 


invariant following perceptual deformation) 
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and who explains his judgments as follows, 
""They are equal because you could always 
change the ‘different’ one back." By Lloyd's 
scheme, based оп Greenfield’s (1966) 
work, these explanations would be labeled 
“direct-action” and viewed as conceptually 
primitive. By Brainerd’s scheme, derived 
from Piaget's concrete-operational theory, 
the same explanations would be denoted 
"inversion reversibility” and viewed as quite 
adequate. It is not necessary to pursue 
this matter further. Suffice it to say that 
. Er is some consensual agreement on 
classification Schemes, arguments for the use 
of the explanation Criterion, no matter how 
trenchant—contain a large element of wishful 
thinking, 
To summarize the Support for the judg- 
ment criterion, although judgments run the 
usual risk of extraneous Type I and Type II 


criterion and therein lies the difference 
between the two, 
CONCLUSIONS 


The author should like to advance two 
Braine’s (1959) 
earlier claim that his procedural innovations 


Piagetian theory 
appears to have been vindicated for the lim- 


re in line with the 


of Piaget’s chief 


Opponents (notably, Bruner) than with 
Piaget's own theory, 
The Second, more Important, conclusion 


Pertains to the question of what function js 
fulfilled by explanations if they are not to 
be used for inferring the absolute presence 


CHARLES J. 


BRAINERD 


of cognitive structures, Although there surely 
are others, the author would like to sug- 
gest one obvious one irom his own re- 
search (Brainerd, 1971a, 1971b; Brainerd & 
Allen, 1971b): Explanations can supple- 
ment judgments in such а way that one is 
provided with insights into the nature of the 
Structure or Structures under consideration, 
It should be recalled in this regard that, 
according to Piaget, language bears a very 
special relation to Cognitive structure. In 
particular, language is à dependent variable 
vis-à-vis cognitive Structure, Logically, this 
implies that ай identifiable structural proper- 
ties of language are, in some sense, inherent 
properties of the cognitive Structures, This, 
in turn, implies that one can discover some, 
though certainly not all, of the Properties of 
cognitive structures via the study of language, 
Obviously, those forms of language which ap- 
pear in direct connection with the function- 
ing of particular Structures (i.e., explanations 
of judgments on Piagetian tasks) constitute 
potentially the most fruitful Source of lin- 
guistic information about that structure, This 
final conclusion also is very much in line 
with the transformational grammarians’ (e.g, 
Chomsky, 1966) approach to language it- 
self and, therefore, it specifies an interface 


between Piagetian cognitive-developmental 
theory and modern developmental psycho- 
linguistics, 
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COMMENT ON MISUSE OF COVARIANCE 


D. A. SPROTT 


University of Waterloo 


In discussing my note on the analysis of co- 
variance (Sprott, 1970), Harris, Bisbee, and 
Evans (1971) note that my correction of the 
"arithmetic" in Evans and Anastasio (1968) still 
left a "significant" F ratio of 12 (instead of their 
1,789). It is curious, and perhaps symptomatic 
of the ritualistic way in which statistical meth- 
ods are often interpreted, that a numerical error 
by a factor of 100 in an F ratio is regarded 
with equanimity. The force of the example is 
apparently unchanged; because it is still “signifi- 
cant,” the example is regarded as fulfilling its 
original purpose. Equally curious is the claim 
that there is a “significant effect even when none 
Was present.” Significance is a property of the 
data; in the example it was present according 
to an objective numerical test. If no effect was 
built into the example, this merely demonstrates 
that the construction of the example was faulty 
(or that they were the victims of an event with 
probability .01). The purpose of my note was to 
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SELBY H. 


Texas Christian 


The original purpose of the illustration (Evans 
& Anastasio, 1968) was to show that analysis of 
covariance could leave a statistically significant 
residual treatment effect even though there was 
no treatment effect in the variate not predictable 
from the covariate. An F ratio is not normally 
used as an index of anything (for an index of 
residual variance, one would ordinarily use a 
partial correlation coefficient) and its magnitude 
bears on decisions of statistical significance only 
in the test of whether the F ratio exceeds or 
fails to exceed the critical value. As to our state- 
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show that there is no substitute for thought. 
Blanket condemnation can be as damaging as 
ritualistic acceptance. The analysis of covariance, 
even when treatment and covariate are corre- 
lated, can be useful and valid when in the hands 
of a scientist who knows his data, knows what 
he is doing, and is prepared to think about the 
interpretation of his analyses. 
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ment that there is a "significant effect even cd 
none was present," the context of the paragrap 
Should make it clear that the word sone refers 


to "no treatment effect in the variate not pre- 
dictable from the covariate.” 
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А COMPARISON OF 


TESTING M ULTIPLE 


TWO PROCEDU RES FOR 


CONTRASTS 


JANE PERLMUTTER AND JEROME L. MYERS! 


University of Massachusetts 


Scheffe’s test for contrasts is frequent 
is of interest. Wher 
the Bonferroni £ $ 
of critical values for the two tests. 


Given the usual multicell experimental 
design, the general null hypothesis that treat- 
ment population means are equal is, at best, 
of minimal interest. The more basic question is: 
Which means, or subsets of means, differ 
from each other. Null hypotheses relevant to 
such questions are generally expressed as linear 
contrasts of the form: 


zu. [11 
When the weights (10) are 1, =1,0,0, ««., the 
contrasts are between two treatments (pair- 
wise comparisons). Frequently, theory or data 
will suggest contrasts between two subsets of 
means. For example, the contrast might be 
between the mean of a pool of two treatment 
groups and the mean of a pool of three other 
| groups. In such a case, the weights would be 
—3, —3, 2, 2, апа 2 ог апу multiplicative 
transform of those weights by à constant. 

As the number of contrasts tested increases, 
the probability of at least one Туре I error, 
the experimentwise error rate (EW), also 
increases. There are a number of excellent 
discussions of the error rate problem, including 
reviews of procedures for controlling а 
(Games, 1971; Miller, 1966; Ryan, 1959). 
It is apparent from such reviews that à 
plethora of procedures has been proposed to 


ае 


У лози; 
زلا ز0ز ت‎ 


y= 


1 Requests for reprints should be sent to Jerome Ta 
Psychology, University of 


Myers, Department 0 

Massachusetts, Amherst, Massachusetts 01002. А 
Tables giving the Bonferroni ¢ critical factor for K 
equal to the maximum number of contrasts, the Scheffé 
critical factor, and the K where the critical factors are 
equal, as well as tables of the criterion statistic for 


various values of K for EWs of .01, .05, and .1 can be 


obtai <i uests to J 
ined by addressing reques ds supported by a 


the above address. Computations W 


Brant of free computer time {тот the ue 
Computer Center, University © Massachusetts, 
mherst, 


tistic provides a reaso 


© 


y used when a relatively small set of contrasts 
ave been planned prior to the data analysis, 
nable alternative. Comparisons are made 


aid the experimenter in his search for significant. 
differences within sets of means. Of these, 
Scheffé’s test (Schefié, 1953) seems preferable 
(сї. Myers, 1972) assuming (а) that it is 
desirable to hold EW at or below some desig- 
nated level (and we believe that it is), (0) 
that contrasts have been selected on a post 
hoc basis, and (c) complex contrasts (contrasts 
assigning nonzero vj to more than two treat- 
ments) are of interest. Scheffé’s test controls 
EW for the infinitely large set of possible con- 
trasts, does not require equal group sizes, is 
relatively robust against nonnormality and 
heterogeneity of variance, and is based on the 
F statistic, thus using readily available tables. 
Where interest centers entirely on pairwise 
comparisons, the Newman-Keuls procedure 
provides an honest experimentwise error rate 
with more power than the Scheflé procedure. 
Since the Scheffé test ensures a constant 
EW, independent of the number of significance 
tests carried out, it is an excellent tool for a 
post hoc search for significant contrasts. How- 
ever, its strength—that it sets EW for the 
entire infinitely large set of possible contrasts— 
is also its weakness. The experimenter pays & 
price in power for the privilege of testing all 
possible contrasts at à designated EW level. 
Dunn (1961) and Davis (1969) have sug- 
gested an alternative to the strategy of select- 
ing contrasts post hoc and testing by the 
Scheflé procedure. They note that an investi- 
gator who specifies even a large number of 
contrasts in advance of the data collection and 
uses the Bonferroni £ test for controlling EW 
for that designated finite set, may well have 
more power than if he had used the Scheffé 
test. The present investigators have replicated 
Dunn’s work and extended it to the .10 level. 
The purpose of the present paper is to sum- 
marize the results of comparing the power of 
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Bonferroni / and Scheflé methods and to note 
the availability of tables summarizing the 
power comparisons and providing criterion 
statistics tor EWs of .01, .05, and .10. The 
extension of Dunn’s work to the .10 level 
should prove useful if we accept Schefié’s 
(1959) recommendation that this level be used 
in testing contrasts. If such a recommendation 
appears heretical, consider that EW is con- 
ceptually very different from the usual a level 
for single tests and that the use of a level 
higher than the conventional a level provides 
increased power while still ensuring that 
individual tests are carried out with extremely 
low probabilities of Type I errors. 

A simple index of the relative power of the 
two techniques is the ratio of their confidence 
interval widths. The Bonferroni / interval is 
dependent upon K, the number of contrasts, 
while the Scheffé interval is not. Confidence 
interval widths were computed for the Bon- 
ferroni / for maximum А (to be defined below) 
and compared with the Schefíé interval for 
the same error degree of freedom and EW. In 
addition, values of A were computed for which 
the intervals were equal in width. 


Two CONFIDENCE INTERVALS 


The criterion statistic for the Scheffé test 
is a — 1 (where a is the total number of 
groups) times the / required for significance 
with a — 1 and error degrees of freedom, and 
probability less than or equal to FW. Thus, 
the Scheffé confidence interval width equals: 


(2): (WM Scrror) * (УЎлоу/п)) 


‘(N (a — 1)Fuw,a — 


ldfew). [2] 


The Bonferroni ¢ method is used when the 
set of contrasts is planned prior to the collec- 
tion of data. The EW is selected for the set of 
contrasts, and partitioned among them. That 
is, typically an alpha level of EW/K is used 
for each contrast; however, the experimenter 
has the option of partitioning the EW in any 
way he chooses, and thus can assign a greater 
portion of the EW and significantly greater 
power to the contrasts of most interest. This 
test will also usually be conservative, but as 
will be pointed out, less so than for the Schefié 
procedure for a wide range of K values. The 
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reason for this conservatism is that this method 
is based on the Bonferroni inequality which 
essentially states that: 


EW < Жа, [3] 
When the contrasts are orthogonal the equality 
will hold approximately but when the contrasts 
are overlapping the test will be conservative. 
The criterion statistic is a two-tailed / test or a 
single degree of freedom F test, апа the 
confidence interval width equals: 


Q)- (NM Sess): (лау) 
. Few Ку 1 fervor). Т 


For purposes of comparing the power of the 
two methods the terms common to both 
intervals can. be dropped and the "critical 
factors" are thus Va = 1)F rw l) 
ХЁкву ук, 1, dfer for the Schefié and Bon- 
ferroni { tests, respectively. It is obvious that 
the Bonferroni ! critical factor is dependent on 
K while the Scheffé critical factor is not. 

Although an infinite number of sets of 
weights (and thus contrasts) can be associated 
with any group of means, a “maximum” 
number of contrasts was defined. This in- 
cluded comparisons between means of in- 
dividual treatment groups or between means 


TABLE 1 
Maximum NUMBERS OF CONTRASTS 


Number of means Maximum number of contrasts 
3 6 
4 25 
5 90 
6 301 
7 906 
8 3,025 
9 9,330 

10 28,501 
11 86,526 
12 261,625 
13 788,970 
14 2,375,101 
15 7,141,686 
16 21,457,825 
17 64,439,010 
18 193,448,101 
19 580,606,446 
20 1,742,343,625 


" m i was 
Note. See text for description of how maximum number © 


arrived at. 
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of two sets of groups. Table 1 gives the maxi- 
mum number of contrasts under this definition 
for from 3 to 20 groups. This number increases 
rapidly as the number of groups increases, but 
few experimenters would have theoretical 
reasons to be interested in nearly all of them. 
In fact, many investigators are only interested 


H " а А а : 
In pairwise comparisons (( )) which аге 
2 


considerably fewer. 

To obtain critical factors for the two test 
statistics, a computer program was written 
to calculate the probability of any F given 
specified numerator and denominator degrees 
of freedom. Simpson's rule was used to evaluate 
the appropriate incomplete beta integral 
(Pearson, 1934) and a search routine was used 
to calculate the appropriate criterion F for 
any probability level within one-hundredth of 
the desired probability. 

Comparison of the Bonferroni ¢ critical 
factor for K equal to the maximum number of 
contrasts, as well as the Scheffé critical factor 
for EWs of .1, .05, and .01 indicates that the 
Scheffé value is always smaller than the 
Bonferroni ¢ value for the maximum number of 
contrasts, and hence more powerful for this A 
value, However, as was pointed out, few 
experimenters would ever consider looking at 
so many contrasts, and thus it is of interest 
lo see at what value of K the critical factors 
are equal. 

The value of А where the Bonferroni / 
critical factor is equal to the Schefié critical 
factor increases as a and error degrees of 
freedom increase, and as EW decreases, but is 
generally a rather large number (except 
perhaps for very small a where there are few 
possible contrasts). It therefore seems that 
power can be gained by planning even a 
liberally large number of contrasts of theo- 
retical interest, and using the Bonferroni ¢ 
method. Even a seemingly large number of 
planned contrasts is often fewer than the 
crossover К. However, even though the cross- 
over K is often rather large, with substantial 
error degrees of freedom the interval widths for 
the two methods may be close enough to 
warrant buying the option of “data snooping” 
for a slight loss in power. Furthermore, if an 
experimenter plans a given number of con- 
trasts, and uses the Bonferroni ¢ method, but 


discovers after "data snooping” additional 
contrasts of interest, he may then use the 
Scheffé method to test the additional contrasts, 
with the consequence that his resulting EW 
is the sum of the EWs used for the two 
methods. Alternatively, he can abandon the 
Bonferroni ¢ approach and use the Scheffé 
method for both the a priori and a posteriori 
contrasts, with a loss of power for the a priori 
contrasts. 

Due to the recommendation to use the 
Bonferroni / method, and the lack of available 
tables for many significance levels, the follow- 
ing formula can be used to calculate the 
criterion / statistic using readily available two- 
tailed Z tables: 


LEW/K, dferzor 
Zew + Zew m. 
tee. < E 


The Newman-Keuls method for pairwise 
contrasts is also more powerful than the 
techniques which control error rate for the 
infinite set of possible contrasts (Schefié & 
Tukey's honestly significant difference). This 
is so because successively less conservative 
tests are done for less divergent means. Un- 
fortunately, no direct comparison can be 
made of the power of the Newman-Keuls and 
Bonferroni / methods, because the confidence 
interval becomes narrower for successive con- 
trasts with the Newman-Keuls test. 

In conclusion, it is suggested that the 
experimenter consider the power of multiple- 
comparison techniques prior to collecting data, 
It appears that the Bonferroni / or Newman- 
Keuls methods will often prove more powerful 
than other methods which control EW. 

In addition, it is worth noting that the 
Bonferroni approach is not restricted to ¢ and 
F tests. That is, the Bonferroni inequality 
(EW < Xaj also holds ior other statistical 
tests such as chi-square and tests of correlation 
coefficients, and thus can be applied to c 
error rate in a variety of situations, 


= Zew; 


ontrol 
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Bonferroni 4 and Schellé methods and to note 
the availability of tables summarizing the 
power comparisons and providing criterion 
statistics for EWs of .01, .05, and .10. The 
extension of Dunn's work to the .10 level 
should prove useful if we accept Schelfé’s 
(1959) recommendation that this level be used 
in testing contrasts. If such a recommendation 
appears heretical, consider that EW 15 con- 
ceptually very different from the usual æ level 
for single tests and that the use of a level 
higher than the conventional a level provides 
increased power while still ensuring that 
individual tests are carried out with extremely 
low probabilities of Type I errors. 

A simple index of the relative power of the 
two techniques is the ratio of their confidence 
interval widths. The Bonferroni ¢ interval is 
dependent upon К, the number of contrasts, 
while the Schefié interval is not. Confidence 
interval widths were computed for the Bon- 
ferroni / for maximum А (to be defined below) 
and compared with the Schefié interval for 
the same error degree of freedom and EW. In 
addition, values of K were computed for which 
the intervals were equal in width. 


Two CONFIDENCE INTERVALS 


The criterion statistic for the Scheffé test 
is а= 1 (where a is the total number of 
groups) times the F required for significance 
with a — 1 and error degrees of freedom, and 
probability less than or equal to EW. Thus, 
the Schefié confidence interval width equals: 


(2). (NM Serror) * (МУлсу/п)) 
„(Х(а — 1)Ёьх,а — 1dferror)- [2] 
The Bonferroni / method is used when the 
set of contrasts is planned prior to the collec- 
tion of data. The EW is selected for the set of 
contrasts, and partitioned among them. That 
is, typically an alpha level of EW/K is used 
for each contrast; however, the experimenter 
has the option of partitioning the EW in any 
way he chooses, and thus can assign a greater 
portion of the EW and significantly greater 
power to the contrasts of most interest. This 
test will also usually be conservative, but as 
will be pointed out, less so than for the Scheffé 
procedure for a wide range of K values. The 
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rcason for this conservatism is that this method 
is based on the Bonferroni inequality which 
essentially states that: 


EW X Xa. [3] 


When the contrasts are orthogonal the equality 
will hold approximately but when the contre 
are overlapping the test will be conservative. 
The criterion statistic is a two-tailed / test or a 
single degree of freedom F test, and the 
confidence interval width equals: 


(2): (VMS error) (VS) 
È NFR й 


For purposes of comparing the power of 
two methods the terms common to both 
intervals can be dropped and the “critical 
factors” are thus (а — 1) уу, 1, after and 
NFgw ку 1, Фе for the Scheffé and Bon- 
ferroni { tests, respectively. It is obvious that 
the Bonferroni 2 critical factor is dependent on 
K while the Scheffé critical factor is not. 

Although an infinite number of sets of 
weights (and thus contrasts) male associated 
with any group of means, a “maximum” 
number of contrasts was defined. This in- 
cluded comparisons between means of in- 
dividual treatment groups or between means 


TABLE 1 


Maximum NUMBERS OF CONTRASTS 


Number of means Maximum number of contrasts 
3 6 
4 25 
5 90 
6 301 
7 966 
8 3,025 
9 9,330 

10 28,501 
1 86,526 
12 261,625 
13 788,970 
14 2,375,101 
15 7,141,686 
16 21,457,825 
17 64,439,010 
18 193,448,101 
19 580,606,446 
20 1,742,343,625 


7 $31 8 er WAS 
Nole. Sce text for description of how maximum number v 
arrived at. 
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of two sets of groups. Table 1 gives the maxi- 
mum number of contrasts under this definition 
for from 3 to 20 groups. This number increases 
rapidly’ as the number of groups increases, but 
few experimenters would have theoretical 
reasons to be interested in nearly all of them. 
In fact, many investigators are onlv interested 


" Кы ый : а P 
In pairwise comparisons (6)) which are 


considerably fewe 

То obtain critical factors for the two test 
stat 5, а computer program was written 
to calculate the probability of any F given 
specified numerator and denominator degrees 
of freedom. Simpson’s rule was used to evaluate 
the appropriate incomplete beta integral 
(Pearson, 1934) and a search routine was used 
lo calculate the appropriate criterion F for 
any probability level within one-hundredth of 
the desired probability. 

Comparison of the Bonferroni ¢ critical 
factor for K equal to the maximum number of 
contrasts, as well as the Scheffé critical factor 
for EWs of .1, .05, and .01 indicates that the 
Scehefíé value is always smaller than the 
Bonferroni / value for the maximum number of 
contrasts, and hence more powerful for this A 
value. However, as was pointed out, few 
experimenters would ever consider looking at 
so many contrasts, and thus it is of interest 
to see at what value of K the critical factors 
are equal. . 

The value of A where the Bonferroni / 
critical factor is equal to the Scheffé critical 
factor increases as a and error degrees of 
freedom increase, and as EW decreases, but is 
generally a rather large number (except 
perhaps for very small а where there are few 
possible contrasts). It therefore seems that 
power can be gained by planning even a 
liberally large number of contrasts of theo- 
retical interest, and using the Bonferroni ¢ 
method. Even a seemingly large number of 
Dlanned contrasts is often fewer than the 
crossover А. However, even though the cross- 
Over K is often rather large, with substantial 
error degrees of freedom the interval widths for 
the two methods may be close enough to 
warrant buying the option of “data snooping” 
for a slight loss in power. Furthermore, if an 
Experimenter plans a given number of con- 
trasts, and uses the Bonferroni / method, but 


discovers after “data snooping” additional 
contrasts of interest, he may then use the 
Schefié method to test the additional contrasts, 
with the consequence that his resulting EW 
is the sum of the EWs used for the two 
methods. Alternatively, he can abandon the 
Bonferroni / approach and use the Scheffé 
method for both the a priori and a posteriori 
contrasts, with a loss of power for the a priori 
contrasts. 

Due to the recommendation to use the 
Bonferroni ¢ method, and the lack of available 
tables for many significance levels, the follow- 
ing formula can be used to calculate the 
criterion { statistic using readily available two- 
tailed Z tables: 

Iw i, f error 
= куук + Hew ix + Zew a ZEVIK, [5] 
F (dferror — 2) 


The Newman-Keuls method for pairwise 
contrasts is also more powerful than the 
techniques which control error rate for the 
infinite set of possible contrasts (Scheffé & 
Tukey’s honestly significant difference). This 
is so because successively less conservative 
tests are done for less divergent means. Un- 
fortunately, no direct comparison can be 
made of the power of the Newman-Keuls and 
Bonferroni ¢ methods, because the confidence 
interval becomes narrower for successive con- 
trasts with the Newman-Keuls test. 

In conclusion, it is suggested that the 
experimenter consider the power of multiple- 
comparison techniques prior to collecting data. 
It appears that the Bonferroni ¢ or Newman- 
Keuls methods will often prove more powerful 
than other methods which control EW. 

In addition, it is worth noting that the 
Bonferroni approach is not restricted to 1 and 
F tests. That is, the Bonferroni inequality 
(EW < Хо;) also holds ior other statistical 
tests such as chi-square and tests of correlation 
coefficients, and thus can be applied to control 
error rate in a variety of situations. 
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SELF- AND INTERPERSONAL EVALUATIONS: 
ESTEEM THEORIES VERSUS CONSISTENCY THEORIES 


STEPHEN C. JONES? 


Cornell University 


Two social psychological theories are contrasted and different predictions are 
derived concerning the relation between self and interpersonal evaluations. 
The prediction from self-esteem theory is that the higher an individual's 
evaluation of himself, the less his tendency to reciprocate evaluations from 
others, whereas the prediction from selí-consistency theory is that the higher 
the individual’s evaluation of himself, the greater his tendency to reciprocate 
evaluations from others. Studies bearing on these predictions are reviewed, 
and the evidence tends to support self-esteem theory. Furthermore, two exten- 
sions of the assumptions of self-esteem theory are presented and discussed 
in terms of accounting for results which apparently support self-consistency 
theory, Finally, whether or not the experimental subject is the direct target 
of another person's actions appears to distinguish esteem results from con- 
sistency results. The implications of this distinction for other problems of 
person perception and social evaluation are discussed. A general conclusion 
is that cognitive consistency theories may be somewhat overworked as 
explanatory frameworks for the study of social evaluations. 


Although the interpersonal events involving 
the evaluations an individual receives from 
other people occur experimentally and in the 


grated approaches which go beyond a few 
intuitively appealing variables, or attempts 
to force a confrontation among various ap- 


“real world” with some regularity, confusions 
arise in attempting to explain these regu- 
larities, We observe that people typically talk 
more (e.g., Bavelas, Hastorf, Gross, & Kite, 
1965; Jones, 1968b) appraise themselves 
more highly (e.g., Videbeck, 1960), and like 
others better (e.g. Aronson & Worchell, 
1966; Backman & Secord, 1959; Jones & 
Panitch, 1971) when they receive approval 
from others than when they receive disap- 
proval, but we know surprisingly little 
about why these things happen. There are 
wide differences of opinion about what under- 
lying motivational processes affect a person's 
reactions to the knowledge that he is liked 
or disliked, that his opinions are valued or 
worthless, that he is accepted as a co-worker 
or rejected as a roommate, or in other ways 
praised or derogated by others. 

The bulk of the investigations relating to 
this motivational issue unfortunately have 
been couched in terms of “mini theories" 
(Aronson, 1969), and attempts at more inte- 


1 Requests for reprints should be sent to 
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proaches have been all too few and far 
between. The purpose of this paper is to 
focus attention on two broad traditions in 
social psychology which relate to problems of 
interpersonal evaluations—self-esteem theo- 
ries and cognitive consistency theories. Using 
as a focal point the question of how an indi- 
vidual reacts to evaluations received from 
others as a function of his self-evaluation, 
this paper discusses the distinctions between 
these theoretical positions, evaluates the evi- 
dence bearing on these distinctions, and 


suggests possible directions for resolving the 
differences. 


‘THEORIES 
“I am good/You love me/therefore you are 
good. . . . /I am bad/You love me/there- 


fore you are bad.” Does Laing’s [1970, p. 10] 
poem of a schizophrenic relation represent a 
typical, self-consistent pattern of interper- 
sonal evaluations? Or, if I think I am bad 
and discover that you love me, am I pleased 
as punch to get a little affection along the 
way? This “knot” of Laing's strikes at the 


heart of the controversy between esteem 
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TABLE 1 


PREDICTED FAVORABILITY OF RATINGS OF OTHERS AS 
A FUNCTION OF SELF-EVALUATION AND 
APPRAISALS RECEIVED FROM OTHERS 


Appraisals from others 


Self- Self-consistency Self-esteem 
evaluation theory theory 
Positive | Negative Positive Negative 
de | 
High | g = + 
То "e + 


++ adici 


and consistency approaches to the relation 
between self and interpersonal evaluations, 


Self-Consistency Theories 


The central notion of self-consistency theo- 
ists is that an individual's actions, attitudes, 
and his receptivity to information from other 
people are strongly affected by a tendency to 
create and maintain a consistent cognitive 
state with respect to his evaluations of him- 
Self, Various reasons are given for this cogni- 
tive tendency including economy in the or- 
ganization of one's perceptions ( Heider, 
1958), the reduction of dissonance (Festinger, 
1957), predictability in relationships with 
others (Newcomb, 1961; Secord & Backman, 
1965), or avoiding cognitions with conflicting 
implications for action (Jones & Gerard, 
1967). 

As elaborated by Secord and Backman 
(1961, 1964, 1965), a state of self-consistency 
Or congruency is said to exist “when [his 


duce the inconsistency, 


Backman (1961) propose that inconsistencies 
may be eliminated by s i 
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other people, and they predict that the indi- 
vidual “tends to increase his liking for 
[others] who behave toward him in a con- 
gruent fashion, and to decrease his liking for 
those who behave in an incongruent manner 
[p. 25]." 

The self-consistency notion has played a 
role in other theoretical approaches to inter- 
personal relationships such as Heider's 
(1958) balance theory, Newcomb's (1961) 
symmetry model, and Baron's (1966) social 
reinforcement theory, It also has been central 
to the thinking of humanistically oriented 
self-concept theorists such as Lecky (1945) 
and Rogers (1951, 1959), Although these 
positions contain many differences in their 
foci and in their implications for action in- 
volving self and others, the common charac- 
teristic which concerns us is the proposal that 
relations between evaluations of the self and 
others are mediated by a tendency toward 
self-consistency. 

To clarify this point and anticipate its 
distinction from the self-esteem theories, con- 
sider an experimental paradigm in which sub- 
jects have either high self-evaluations or low 
self-evaluations and receive either positive 
evaluations or negative evaluations from other 
members of their group. The question is to 
what extent the subjects accept or reject their 
peers. For individuals with high evaluations 
of themselves or some aspect of self, positive 
evaluations from others are consistent. and 
negative evaluations are inconsistent, whereas 
for individuals with low self-evaluations, 


positive evaluations are inconsistent and 
negative evaluations are consistent, There- 


fore, the prediction from the self-consistency 
theories is that high self-evaluators will react 
more favorably to approval than to disap- 
proval and that low self-evaluators will 
react more favorably to disapproval than to 


approval. This prediction is described in 
Table 1. 


Self-Esteem T, heorics 


The self-esteem position assumes that the 
individual 


satisfaction, 
Although this need 
at any given point 
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in time it may manifest itself with respect to 
a particular aspect of one's selí-evaluation 
rather than to more global feelings about 
the self. Furthermore, the state of the need 
varies with the degree of personal satisfac- 
tion or frustration the individual experiences 
in a particular situation or period of time. 
The self-esteem need also varies across indi- 
viduals. It is assumed that this individual 
difference variation is reflected in attitudinal 
measures of self-esteem and that persons with 
high self-esteem are relatively more satisfied 
with respect to this need than persons with 
low self-esteem. 

As was the case with the self-consistency 
theories, the self-esteem need is responsive 
{о evaluative information the individual gains 
from his own behavior and comparative or 
reflected appraisals from other people. Coping 
effectively with the tasks and problems one 
encounters in his physical and social environ- 
ment as well as gaining information from 
others that he is liked and respected or that 
his actions or characteristics are highly 
evaluated produce satisfactions for his selí- 
esteem need, Data from a variety of social 
psychological studies, including, for example, 
studies of social influence (Hovland & Janis, 
1959), self-presentation (Jones, 1964) and 
social comparison (Morse & Gergen, 1970), 
have been interpreted in terms of this direct 
relation between self-esteem needs and social 
acceptance or rejection. 

The critical question for purposes of com- 
parison with self-consistency theories is what 
predictions self-esteem theories make regard- 
ing the individual's reactions to positive or 
negative evaluations received from. other 
people. In concert with the self-consistency 
theories, the major sources of change in 
thought or action involve the individual's 
evaluation of himself, his choice of activities 
or social roles, and his relationships with 
others. The difference is that these changes 
are designed to enhance self-esteem rather 
than to achieve self-consistency. 

In general, one would expect people to 
respond favorably (i.e., like, benefit, agree 
with, etc.) to positive evaluations of them- 
selves which are assumed to satisfy esteem 
needs and to respond unfavorably (i.e., dis- 
like, aggress against, disagree with, etc.) to 


negative evaluations of themselves which are 
assumed to frustrate esteem needs. This pre- 
diction follows from the social exchange no- 
tions of Homans (1961) and Thibaut and 
Kelley (1959), the notion that attraction to 
a group is directly proportional to the needs 
satisfied by the group (Cartwright & Zander, 
1960; Dittes, 1959), as well as the frustra- 
tion-aggression hypothesis (Dollard, Doob, 
Miller, Mowrer, & Sears, 1939), Further- 
more, this tendency should interact with the 
magnitude of the esteem needs as experimen- 
tally manipulated or measured. Specifically, 
low self-esteem individuals are predicted to 
respond more favorably to positive evalua- 
tions from others and more unfavorably to 
negative evaluations from others as compared 
to high self-esteem individuals. This predic- 
tion, first articulated by Dittes (1959), fol- 
lows from the assumption that low self- 
esteem people have greater needs for esteem 
enhancement and are therefore more satisfied 
by the approval of others and more frustrated 
by the disapproval of others than are high 
self-esteem people. The prediction is described 
in Table 1. 


EMPIRICAL EVIDENCE 


The critical test between the self-esteem 
and self-consistency theories involves manipu- 
lating or measuring the individual's evalua- 
tion of himself and the evaluations he receives 
from other people, then measuring the feelings 
he expresses to or about his associates, their 
characteristics or actions. At the present time 
there are a substantial number of studies 
which meet in whole or in part the conditions 
necessary for this critical test. 


Self-Consistency Findings 


Consider, first, the research which gen- 
erally is viewed as supporting the self- 
consistency position. Three of the six studies 
are correlational in design, and they differ 
from other investigations described in that all 
the evaluative components measured are re- 
ports made by the subject. In particular, the 
evaluations purportedly received by the sub- 
ject from others are estimates as to how he 
thinks others are evaluating him. 

In a study of fourth-, fifth-, and sixth- 
grade school children of both Sexes, Wiest 
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(1965) measured global self-esteem and cor- 
related these measures with indices of the 
degree of congruency existing between a stu- 
dent's attraction. for each other member of 
his group and his perception of others’ feel- 
ing toward himself. Wiest obtained a signifi- 
cant positive correlation (.23) which he had 
predicted from Heider’s balance theory. As 
self-esteem increased, interpersonal congru- 
ency also increased. Backman and Secord 
(1962) asked each member of a college soror- 
ity to select from a list of 16 adjective pairs 
the 5 adjectives most characteristic of her- 
self, to indicate which 5 adjectives each 
other sorority member would attribute to 
her, and to rank order the other members 
according to their attractiveness. Interper- 
sonal congruency was measured by the de- 
gree of overlap between the adjectives a 
subject selected for herself and those she 
estimated would be attributed to herself by 
others. In support of the Secord and Back- 
man (1961) model, more congruency was ob- 
tained for the subject’s ratings of the sorority 
members most liked by the subject than for 
the members least liked by the subject. From 
results obtained in a longer range investiga- 
tion, Newcomb (1956) reported that inter- 
personal attraction was closely related to the 
perceived agreement between a person’s self- 
description and his estimate of how other 
members of the dormitory would describe him. 

In experimental studies as contrasted with 
correlational studies of the self-consistency 
theory, evaluations from others (which were 
manipulated) actually were received rather 
than estimated by the subject. In one of the 
three studies which partially fulfill the de- 
sign conditions described above, Wilson 
(1965) led subjects to believe that they had 
failed on a task, Then either by having sub- 
jects make their own decisions (personal 
decision condition) or by having these deci- 
sions made for them by a “chance” procedure 
(chance decision condition), he brought 
about the decision that Subjects would not 
proceed with a second task. After this deci- 
Sion, subjects received a bogus comment sup- 
posedly written by their partners, which in 
half of the conditions described a subject as 
competent and implied that he should accept 
the second task, and in half of the conditions 
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described him as incompetent and implied 
that he should not accept the task. Wilson 
found that of the subjects in the personal 
decision conditions, those receiving a negative 
note rated their partners as more accurate 
and slightly more attractive than those Ж 
ceiving a positive note, For subjects in th 
chance decision condition, those receiving a 
positive note rated their partners as more 
emotionally supportive, more attractive, and 
slightly more accurate than those receiving 
the negative note. , 

Although this study is often cited as sup- 
port for the self-consistency position (e.g. 
Aronson & Mettee, 1968; Secord & Backman, 
1965), there is the question of why chance 
decision subjects who had also failed the first 
task and presumably held negative evalua- 
tions of their own abilities actually preferred 
partners who evaluated them positively but 
inconsistently. What most clearly distin- 
guishes the chance from personal decision 
conditions is not a difference in level of self- 
evaluation but whether or not the subject 
made a personal decision not to continue. 
Therefore, although self-evaluative processes 
may be involved in the study, the results can 
be interpreted quite easily by assuming that 
individuals tend to like better and view as 
more accurate others who agree with their 
decisions as compared with those who disagree 
(cf. Festinger, 1954, 1957). 

The most widely cited study of self- 
consistency and interpersonal evaluations is 
Deutsch and Solomon’s (1959) test of deriva- 
tions from Heider’s (1958) balance theory. 
In their 2X 2 design, self-evaluation was 
manipulated by informing subjects that they 
had either succeeded or failed on each of 
two tasks, and evaluations from others were 
manipulated by having each subject re- 
ceive a note from one of her teammates 
stating either (e.g.) “You are the person I 
most prefer to have on my team again” 
or (e.g.) “You are the person I least 
prefer to have on the same team with me 
again." Analyses of evaluative ratings each 
subject made of her note-sending teammate 
showed, first, a “positivity” effect, that is, a 
tendency for subjects to rate more favorably 
the positive note sender than the negative 
note sender, and second, a self-consistency 
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effect, that is, an interaction between selí- 
evaluation and evaluations received from 
others in which the tendency for such a 
positivity effect was obtained only for sub- 
jects in the high-self-evaluation treatment. 
Low-self-evaluators tended to rate the 
negative note sender slightly, though not 
significantly, more favorably. 

This experiment has not gone uncriticized. 
Pepitone (1966) reinterpreted their data by 
arguing that subjects may have been attempt- 
ing to fulfill the announced purpose of the 
experiment by demonstrating their skill at 
evaluating their teammates. An inconsistent 
evaluator would be perceived as inaccurate 
and, in this context, be less attractive than 
consistent evaluators. A more serious problem 
for the Deutsch and Solomon study is 
Skolnick's (1971) failure to replicate their 
results even though he used an almost identi- 
cal procedure. In fact, Skolnick found support 
for the prediction from self-esteem theory; 
low self-evaluators rated the positive note 
sender more favorably and the negative 
note sender less favorably than did high 
self-evaluators. 

A third experiment tending to support the 
self-consistency prediction was reported by 
Dutton and Arrowood (1971). Subjects were 
led to believe that they had performed either 
well or poorly in presenting arguments about 
draft resistance to a panel of peers who, 
independently of, and crosscutting the per- 
lormance manipulation, ostensibly judged the 
Subjects’ performances as either a “good job” 
or a “poor job.” The results showed that 
subjects who thought they had done well were 
more attracted to the peers who made positive 
evaluations than to those who made negative 
evaluations. Subjects who thought they had 
done poorly were more attracted to peers 
Who made negative evaluations. 

Although this study appears to support 
consistency theory, there is reason to ques- 
tion whether or mot the good and poor per- 
formance conditions represent a self-evalua- 
tion manipulation. The task that subjects 
Performed consisted of reading arguments 
Prepared by the experimenter, not the sub- 
ject. In the poor-performance condition the 
Subject was given an objectively confusing 
task, then informed by a physiological mea- 
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sure that she felt “extremely anxious,” a con- 
dition which presumably arose from the “cog- 
nition that she was performing poorly.” In 
the good performance condition, the task was 
objectively clear and the physiological mea- 
sure indicated no anxiety. Thus, the subject 
merely read an argument, which was either 
clear or confusing and which the experimenter 
created, and subsequently her peers either 
agreed or disagreed with what her body pre- 
sumably was telling her. Under these circum- 
stances it is hard to see how the subject 
felt any personal responsibility for either 
her performance or the consequences of her 
performance. 


Self-Esteem Findings 


Both correlational and experimental data 
have supported self-esteem theory. 

In an extensive field investigation of the 
correlates of self-esteem, Rosenberg (1965) 
asked his sample of high school juniors and 
seniors, “How much does it bother you to find 
that someone has a poor opinion of you?” 
Since Rosenberg correlated answers to this 
question with a range of self-esteem scores, 
this aspect of his study partially fulfills the 
design criteria for this review. In effect, high 
and low self-evaluators react to imagined 
negative evaluations from others. In support 
of the prediction from self-esteem theory, 
Rosenberg found that the lower a person's 
self-esteem, the more he was bothered by the 
poor opinion of another person. 

The earliest experimental study of the rela- 
tion between self-esteem and interpersonal at- 
traction based on the self-esteem formulation 
was conducted by Dittes (1959). Self-esteem 
was measured by various self-reports made 
by the subjects and by the actual evaluations 
made by other members of their groups. At 
the end of a group discussion, the members 
were permitted to see experimentally ma- 
nipulated peer ratings of their acceptability 
to the group. Dittes found that acceptance 
by the group interacted significantly with 
self-esteem. For subjects who learned that 
they were not acceptable to the group, the 
lower their self-esteem, the less attracted 
they were to the group. However, for sub- 
jects learning that they were acceptable to 
the group, there was a slight tendency for 
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those with lower self-esteem to be attracted 
more to the group. A similar interaction ef- 
fect was obtained by Jones, Knurek, and 
Regan (in press). In their experiment selí- 
esteem was measured with the Janis-Field 
scale (Hovland & Janis, 1959). Following a 
half-hour period for becoming acquainted 
with another person, subjects learned that 
their partners either liked them or disliked 
them. On both a “desire to change partner” 
measure an a “general emotional reaction" 
measure low self-esteem subjects indicated 
more attraction to their partners in the like 
condition and slightly less attraction to their 
partners in the dislike condition than did 
high-self-esteem subjects. 

Walster and her colleagues have obtained 
similar results from experiments in which self- 
esteem was manipulated rather than mea- 
sured. They provided subjects with false 
feedback from personality tests indicating 
either that they were mature, strong, compe- 
tent, socially responsible, etc. (high-self- 
esteem conditions) or immature, inflexible, 
and possessing other negative characteristics 
(low-self-esteem conditions). In the first 
study, Walster (1965) found that female sub- 
jects in the low-self-esteem condition were 
more attracted to a male suitor than were 
subjects in the high-self-esteem condition, In 
the second study (Jacobs, Berscheid, & 
Walster, 1971), the dating skills of a male 
subject were either praised and he was ac- 
cepted as a potential date by a same-aged 
female, or his dating skills were derogated 
and he was rejected as a potential date, Once 
again, males in the low-self-esteem condition 
liked the female evaluator more in the ac- 
cepting condition but also disliked her more 
in the rejecting condition than did males in 
the high-self-esteem condition. 

Results Supporting the self-esteem theory 
have been obtained not only for measures 
and manipulations of global self-esteem but 
also for more specific aspects of the self. In 
addition to Skolnick’s Study (1971) already 
Cited, Jones (1966) manipulated subjects? 
evaluations of their design judgment ability 
and subsequently had groups of subjects ex- 
change evaluations of performances relating 
to this ability, Half of the subjects received 
mostly positive evaluations from the group, 
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and the remainder received mostly negative 
evaluations, The evaluations subjects made of 
their own and other members’ performances 
constituted the main dependent variables. 
Although the pattern of data indicating sub- 
jects’ ratings of their peers’ performances 
were in the direction predicted by self-esteem 
theory, the interaction between manipulated 
self-evaluation and evaluations received from 
others was not statistically significant. How- 
ever, Jones correlated the subjects’ evalua- 
tions of their own performance with those 
made of other members’ performances and 
found results supporting the self-esteem 
theory. Specifically, in the positive evaluation 
condition, the more positively subjects rated 
their own performances, the more negatively 
they rated the responses of others, but in the 
negative evaluation condition, the correlation 
between ratings of self and others were posi- 
tive. Although this latter correlation was 
not significant, it differed significantly from 
the corresponding correlation in the positive 
evaluation condition. 

Regarding these correlational findings, it 
is interesting to note that Harvey (1962) 
obtained comparable results in a study which - 
generally is viewed as support for the self- 
consistency theory. His main finding was that 
the more negative the evaluations a subject 
received from another person, the less favor- 
able were his attitudes toward the other 
person. These results can be explained by 
either self-consistency or self-esteem theory. 
However, in an internal correlational analy- 
sis, Harvey found that the more a subject 
changed his evaluation of himself in a nega- 
tive direction, the more he also changed his 
evaluation of the other person in a negative 
direction, Since the experimentally controlled 
evaluations subjects received from the other 
person were in four out of five conditions 
negatively discrepant from his self ratings, 
this positive correlation contradicts the self- 
consistency theory. This theory would predict 
that as the self-evaluation changes in the 
negative direction advocated by the other 
person, the attraction to this other person will 
increase, 

Secord and Backman’s (1961) version of 
self-consistency theory views the self-evalua- 
tion as more flexible and subject to change 
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than most other versions of the theory. 
Consequently, studies apparently supporting 
self-esteem theory but measuring only the 
evaluations a subject makes of others and 
not changes in his self-evaluation must be 
interpreted with some caution. In this light 
the findings of Jones and of Harvey, which 
had evaluations of both self and others as 
dependent variables, become particularly im- 
portant in a comparative evaluation of self- 
esteem and self-consistency theories. 

One further study relevant in this issue of 
the correlations between evaluations of self 
and others was conducted by Jones, Hester, 
Farina, and Davis (1959). Self-evaluation 
was measured indirectly by asking subjects 
to indicate how they expected to be rated by 
two observers on such dimensions as selí- 
centeredness, popularity, tolerance, generos- 
ity, and self-confidence. The data reported 
by Jones and his colleagues concerned sub- 
jects’ evaluations of the likability of an ob- 
server who evaluated them in a highly deroga- 
tory fashion on the same dimensions for 
Which subjects had predicted the observers' 
reactions, The characteristics of the dero- 
gator were manipulated. In one condition 
the derogator was portrayed as unstable and 
maladjusted and in the other condition as 
well adjusted and insightful. 

Before viewing the data, consider the pre- 
dictions deriving from the two theories. As 
noted above, when the evaluations received 
from the other person are negative, as they 
were in the present study, the self-consistency 
theory prediction is that the lower the self- 
evaluation (in this case the expected ratings 
of others), the more positive the evaluation 
of others (ie., a negative correlation). The 
Self-esteem prediction is that the lower the 
Self-evaluation, the more negative the evalua- 
tion of others (ie. a positive correlation). 
Furthermore, assuming that the appraisals 
received from the well-adjusted derogator are 
More important than those received from the 
Maladjusted derogator, then, on the one 
hand, from self-consistency theory one would 
expect a more negative correlation in the 
adjusted other condition than the maladjusted 
Other condition, The evaluations of the ad- 
Justed other presumably are more accurate, 
and consistency with regard to self-ratings 1s 


TABLE 2 
SUMMARY OF STUDIES 


Status of evaluations Theory 
Author(s) from others supported 
Backman & Secord | Estimated by subject | Consistency 


Solomon Controlled by experi- 
menter 

Controlled by experi- 
menter 

Controlled by experi- 
menter 

Controlled by e 
menter 

Controlled by experi- 
menter 

Controlled hy experi- 
menter 

Controlled by experi- 
menter 

Controlled by experi- 


Consistency 


) 
Dittes (1959) 


Esteem 


Dutton & Arrowood 
(1971) 
Harvey (1962) 


Berscheid, & 
1971) 


& Da a 
Jones (1966) Esteem 


Jones, Knurek, & Esteem 
Regan (in press) menter 
Newcomb (1956) Estimated by subject 
Potter (1970) Controlled by experi- 
menter 
Controlled by experi- 
menter 
Controlled by experi- 
menter 
Walster (1965) Controlled by experi- 
menter 
Wiest (1965) Estimated by subject 
Wilson (1965) | Controlled by experi- 
menter 


Rosenberg (1965) 


Skolnick (1971) 


correspondingly more important. On the other 
hand, from self-esteem theory one would ex- 
pect a more positive correlation in the ad- 
justed other condition than the maladjusted 
other condition. The derogation of the well- 
adjusted peer is more damaging to self-esteem 
needs and should have more negative impact 
on the person with low expectancies about his 
value for others. 

The correlational data support the self- 
esteem analysis. First, in the adjusted dero- 
gator condition, the relation between ratings 
of self and other is highly positive (.84) and 
corresponds in direction to the results ob- 
tained by Jones (1966) and Harvey (1962). 
Second, this correlation is significantly more 
positive than in the maladjusted derogator 
condition where, in fact, the correlation is 
strongly negative (—.62). 

One final piece of supporting evidence for 
the self-esteem position comes from a study 
by Potter? who manipulated subjects’ 
appraisals of their own social sensitivity as 
well as whether or not they were liked by 
their partners. Although subjects were more 


2D, A. Potter. Accuracy and interpersonal attrac- 
tion. Unpublished doctoral dissertation, Cornell Uni- 
versity, 1970. 
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attracted to their partners when their part- 
ners liked them as compared to when their 
partners disliked them, this tendency was sig- 
nificantly greater for low-self-evaluators than 
for high-self-evaluators. 


Summary of Evidence 


The implications of these studies, which 
may be viewed as "critical tests" between 
the two theories, are summarized in Table 2. 
Тһе table also summarizes the status of 
the evaluations subjects experienced írom 
others—that is, whether these evaluations 
were estimated by the subject or experimen- 
tally controlled. Two aspects of the table are 
worth noting. First, the evidence, in general, 
tends to favor self-esteem theory over selí- 
consistency theory, Of the 16 investigations 
reviewed, 10 support self-esteem theory, and 
the author has suggested that there are seri- 
ous problems of interpretation or replication 
with the experimental studies often cited as 
Support for self-consistency theory. Further- 
more, the self-esteem evidence becomes more 
impressive when one considers the variety 
of different procedures and measures used in 
these studies. However, at this point, to draw 
any firm conclusions about the relative value 
of self-esteem theory would be premature. 
Some data do seem to support self-consistency 
theory, and the question is whether these 
data can be accounted for within an expanded 
self-esteem theory framework. 

A second feature of Table 2 is that all 
three studies in which peer evaluations were 
estimated by the subject support self- 
consistency theory, whereas 10 of the 13 
studies in which these evaluations were in 
some way controlled by the experimenter sup- 
port self-esteem theory. This pattern of re- 
sults suggests that a subject may be con- 
fronted with quite different problems depend- 
ing upon whether he must estimate another 
person’s feelings or is actually informed of 
those feelings as they relate to some aspect 
of himself. This difference is discussed later 
on in the paper. 


TOWARD A RESOLUTION 


People often comment on the “counter- 
intuitive” quality of many predictions from 
Cognitive consistency theories, and some of 
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the implications of self-consistency theory are 
subject to this general bafflement. Many 
authors working primarily within a cognitive 
consistency framework have acknowledged the 
discrepancies between cognitive consistency 
motivations and self-esteem, self-enhance- 
ment, or even achievement motivations, and 
have expressed such reservations. Rosenberg 
(1968), for example, observed that one of 
the most important failings of the consistency 
approach Ы 

seems to be the experimental affirmation of this 
hard, intractable fact: the need for (or trend 
toward) the maintenance of internally consistent 
affective cognitive structures is often subordinated to 
man's penchant for trying to think well of himself 
and optimistically of his prospects |p. 3841.” 


In response to this dilemma, some authors 
have viewed esteem motivations as “їп 
addition to” forces affecting behavior in 
experiments designed to demonstrate self- 
consistent responses (e.g., Aronson and Carl- 
smith, 1962; Pepitone, 1964) whereas others 
have tried to expand theories of consistency 
to account for self-esteem tendencies (e.g. 
Secord & Backman, 1965). 

The author's own orientation to this prob- 
lem was to ask whether apparently selí- 
consistent actions and attitudes might not 
be explicable solely in terms of self-esteem 
theory. In particular, two variables which 
may be viewed as extensions of this latter 
theory—the anticipation of future conse- 
quences from accepting or rejecting the evalu- 
ations of others, and the degree of personal- 
istic focus attributable to these evaluations— 
proved to be useful in accounting for a vari- 
ety of results in this area of self- and inter- 
personal evaluations. 


Consequences of Accepting or Rejecting 
Evaluations 


As one alternative interpretation of self- 
consistent responses, consider the possibility 
that they reflect a tendency for people to 
react to evaluative information in terms not 
only of its present but also its future implica- 
tions for esteem enhancement. In general, 
people prefer to accept approval from others 
and to reject disapproval. However, there are 
numerous occasions on which a person's ac- 
cepting inconsistent and inaccurate evalua- 
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tions for himself has important and poten- 
tially hazardous consequences for his future 
decisions, responsibilities, and performances, 
and these consequences can be as frightening 
for the low-self-evaluator who is overrated as 
they are humiliating for the high-self-evalu- 
ator who is underrated. In addition to the 
failures and limited successes of his actual 
performances, the individual also may suffer 
social embarrassment from misrepresenting 
himself or from the exposed discrepancy 
between others’ expectancies and his actual 
effectiveness. 

These anticipated successes or failures and 
expressions of approval or disapproval which 
are contingent on the individual's accepting 
evaluations of himself can be satisfying or 
frustrating to his need for self-esteem en- 
hancement. Hence, this position assumes that 
some apparently self-consistent responses can 
be viewed in terms of people forfeiting im- 
mediate gratifications of esteem needs in 
anticipation of more self-enhancing or less 
self-derogating experiences in the future. 

Two classes of conditions have been identi- 
fied which are assumed to increase the 
lience of the future consequences of inter- 
personal evaluations: predecisional conditions 
in which a person anticipates making choices 
for himself which involve his self-evaluation, 
and self-exposure conditions in which a person 
anticipates the public revelation of his actual 
successes or failures. The general hypothesis 
Suiding the research described below is that 
the anticipation of making a personal decision 
or of self-exposure will increase the tendency 
of the individual to make apparently self- 
Consistent responses in his relations with 
Others, 

In a study of the effects on interpersonal 
evaluations of anticipating personal decisions, 
Jones and Ratner (1967) induced all sub- 
lects to adopt negative evaluations of their 

Personality inference” ability by giving them 
false low scores on each of a series of four 
Personality tests. In a second part of the 
Experiment three subjects at a time partici- 
Dated in a group session in which they evalu- 
ated each other's responses to items like Lini 
Appearing ity tests. The evalu- 
ations pay پو ن‎ ete و‎ 
trolled ‘ect received mostly 

so that each subject 


positive evaluations from one peer and mostly 
negative evaluations from the other. Before 
actually beginning this second phase of the 
experiment, subjects were told that there 
would be a third part of the study in which 
they would be required to use this “personal- 
ity inference" ability in a case analysis which 
would be evaluated by a clinical psychologist. 
Halí the subjects immediately chose how 
difficult a clinical case they wished to work 
on (postdecision condition), and, as expected, 
their choices were from among the easier 
cases, The remainder of the subjects antici- 
pated making their choices aíter the group 
session (predicision condition). 

The argument made by Jones and Ratner 
was that prior to making a decision for him- 
self, a subject would prefer accurate, self- 
consistent information from others, but that 
after making the decision as to clinical ta: 
he would, in effect, be protected from the 
hazardous consequences of accepting erro- 
neous evaluations and would prefer informa- 
tion providing more esteem gratifications, 
that is, positive evaluations. The results 
showed that in the postdecision condition 
subjects evaluated the responses of the posi- 
tive evaluator more favorably than the nega- 
tive evaluator but that in the predecision 
condition subjects more favorably evaluated 
the responses of the negative evaluator. 

Three experimental investigations have 
been conducted which relate to the anticipa- 
tion of self-exposure. A study by Jones and 
Pines (1968) used a procedure very similar 
to the study just described. Once again, sub- 
jects were given negative feedback on the 
"personality inference" tests and, subse- 
quently, during a group session received 
mostly positive evaluations from one peer and 
mostly negative evaluations from the other. 
One further manipulation involved self- 
exposure. Half the subjects were led to be- 
lieve that following the group session they 
would once again give their answers aloud 
but the correctness of their answer would be 
indicated publicly by the experimenter. Thus, 
in this condition subjects expected to have 
their level of skill (which they thought was 
low) revealed to their peers. The other half 
of the subjects were not led to expect such 
self-exposure. 
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As predicted and in support of the theo- 
retical analysis presented above, subjects 
expecting self-exposure increased the favor- 
ability of their appraisals of the negative 
evaluator relative to the positive evaluator 
as the time of the experimenter's public 
evaluation approached, and those not expect- 
ing self-exposure became increasingly more 
favorable to the positive evaluator. Further- 
more, subjects in the self-exposure condition 
reported that the negative test evaluation 
from experts was a more meaningful indica- 
tion of their abilities, whereas those in the 
no self-exposure condition found the rela- 
tively more positive evaluations of the peers 
(in combination) were the better indicators 
of their ability. 

An unpublished dissertation by Hara- 
guchi? was a conceptual replication of the 
Jones-Pines study but employed a quite dif- 
ferent procedure suggested by Jones’ (1964) 
analysis of self-presentation. Subjects were 
led to believe from a highly qualified source 
that they would do relatively poorly on an 
achievement test. However, the subsequent 
estimate of an inexperienced clinical inter- 
viewer was that they would do very well on 
the test. The prediction was that subjects 
would present themselves in a more modest 
fashion to the interviewer under self-exposure 
conditions in which they expected to discuss 
their test scores with the interviewer as 
compared to the no-self-exposure conditions. 

Although the main analysis of the self- 
presentation data revealed no significant ef- 
fect of the self-exposure variable, Haraguchi 
found in an internal analysis that the results 
were very much affected by the perceived 
attractiveness of the interviewer. She reasoned 
that subjects would be more likely to avoid 
embarrassment and disapproval to the extent 
that they favorably evaluated their inter- 
viewers. The results of this internal analysis 
showed that for the half of the subjects who 
were most attracted to their interviewers the 
results were in accordance with her initial 
prediction; that is, those subjects in the 
self-exposure condition evaluated themselves 


t 3R. S. Haraguchi. Interpersonal conditions affect- 
ing self presentation. Unpublished doctoral disserta- 


tion, State University of New York at Buffalo, 
1967. 


less favorably (and more consistently with 
respect to their self-evaluations) than those in 
the no-self-exposure conditions. These results 
were significantly reversed for the remainder 
of the subjects who were least attracted to 
their interviewers. 

Finally, Eagly and Acksen (1971) con- 
ducted a study which may be viewed within 
the present framework, In their experiment 
subjects received notes from trained graduate 
students indicating that they were either more 
creative or less creative than they had evalu- 
ated themselves. Subsequently, changes in 
self-appraisals were measured either under 
two conditions in which subjects expected to 
take a creativity test in which their degree 
of success would be obvious to themselves and 
the graduate student administering the test, 
or under a condition in which no tests were 
expected. The authors obtained an interaction 
between expectancy of testing and favor- 
ability or unfavorability of the student’s note. 
Only when subjects received a favorable note 
and expected no test did they change their 
self-evaluation in the direction of being more 
creative. Furthermore, in the favorable note 
conditions subjects changed their self-evalua- 
tions more in the direction of the message 
when they anticipated no test as compared to 
when they did expect the test; the corre- 
sponding comparison for the unfavorable note 
conditions revealed no significant differences, 

In general, the results reported in this 
Section support the notion that apparently 
self-consistent behavior may result from the 
future implication of evaluations for esteem- 
enhancement, Whether people enhance self- 
esteem by accepting positive evaluations of 
themselves or by being more realistic about 
themselves has been shown to be affected by 
their expectancies regarding events which 
could reveal their true worth.* 


Implications of *Personalism" for Self-Esteem 
Enhancement 


There is a second analysis which suggests 
that self-consistent interpersonal evaluations 
may be interpreted with the assumptions of 


1 Jones (1968a) and Mettee (1971) have made a 
similar argument in an attempt to explain apparently 
self-consistent responses in a nonsocial testing situa- 
tion designed by Aronson and Carlsmith (1962). 
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self-esteem theory. Given that an individual 
is liked, approved of, or agreed with by an- 
other person, he probably finds this informa- 
tion more rewarding to the extent that its 
cause is attributable to characteristics of 
himself rather than to some cause which is 
unrelated to himself. For example, thinking 
“Не likes me because of the kind of person 
I am." is undoubtedly more rewarding than 
thinking *He likes me because he's the kind 
of person who likes everybody." By the same 
token various sorts of negative evaluations 
may well be increasingly more damaging to 
self-esteem needs when the individual sees 
himself as the cause of the evaluations as 
compared to when the cause is externally 
attributed. 

This analysis draws heavily upon Jones 
and Davis (1965) concept of personalism. 
In line with their thinking, I assume that to 
the extent that actions (such as interpersonal 
evaluations) are attributable to, or “uniquely 
focused” on the self, those actions have more 
impact on the self-esteem needs of the indi- 
vidual. Thus, to the degree that positive 
evaluations are perceived as focusing on the 
self, esteem needs are satisfied; and to the 
degree that negative evaluations are focused 
in this manner, esteem needs are frustrated. 

An important variable affecting personal- 
istic attributions may well be the similarity 
Or consistency existing between the indi- 
vidual’s evaluation of himself and the evalu- 
ations he receives from the actor. When the 
individual and another party have similar 
Opinions about an entity, there is at least the 
Possibility that the other person is focusing 
оп, and being directly affected by the same 
events as the subject in question (cf. Kelley, 
1967). 

1f seli-consistency produces the attribution 
Of personal cause, then this analysis, once 
again, leads to the conclusion that self- 
Consistency per se is not a general motiva- 
tional state. Rather, personalism via self- 
Consistency interacts with the esteem en- 

ancing or derogating value of the actions 
directed toward the individual. Thus, the 
hypothesis to be considered is that when an 
individual is accepted or liked by others, the 
More self-consistent their evaluations of his 
characteristics, the more he is attracted to 
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them, but when an individual is rejected or 
disliked by others, the more self-consistent 
their evaluations, the less he is attracted 
to them. 

Two studies were designed to test this 
general notion. In the first study, Jones et al. 
(in press) assumed (and self-report data con- 
firmed this assumption) that subjects would 
have generally favorable ratings of their own 
characteristics and manipulated selí-consist- 
ency by providing the subjects with positive 
(and consistent) trait ratings by their part- 
ners or with negative (and inconsistent) 
ratings. Subsequently, whether a partner 
liked or disliked a subject was varied, and 
measures of the subject’s attraction to his 
partner revealed a significant interaction be- 
tween self-consistency and liking which par- 
tially supported the above hypothesis. When 


E 


subjects were liked by their partners, n 
d 


attraction was greater when they receive 
positive trait ratings rather than negative 
ratings from their partners; when subjects 
were disliked they were slightly less attracted 
when they received positive ratings. Although 
alternative interpretations of these findings 
were implausible in the light of other data 
obtained in this study, a more substantial test 
of the personalism hypothesis requires a 
direct manipulation of self-consistency. 

An unpublished dissertation by Potter 
(see Footnote 2) permits such a test. Subjects 
were first given two written tests of their so- 
cial sensitivity and received test feedback in- 
dicating that they had either a high level of 
skill or a low level of skill. Then they par- 
ticipated in a structured “social sensitivity 
interview" in the presence of a partner who 
subsequently evaluated them as being either 
well above average or well below average in 
social sensitivity. Since there was evidence 
that the test feedback affected self-evalua- 
tions, the test and interpersonal evaluations 
of social sensitivity, which constituted a 
2 X 2 design may be considered in conjunc- 
tion as a manipulation of self-consistency. 
Evaluations which were similarly high or low 
are consistent, but evaluations which were 
dissimilar are inconsistent. Finally, on an ad- 
ditional evaluation form subjects were in- 
formed that their partners either liked or 
disliked them. 
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ATTRACTION TO. PARTNERS As А FUNCTION 
ОЕ SELF-CoNsISTENCY AND LIKING 


Self-consistency | Likes | 


Dislike 
| = 
Consistent 2.34 | 4.05 
Inconsistent 3.06 3.44 

a The lower the number, the greater the subject's liking for 


his partner. 


The major dependent variable of this study 
Was a question designed to assess how much 
a subject liked his partner. If self-consistency 
is a general motivational tendency, then one 
would expect that regardless of whether they 
were liked or disliked by their partners, sub- 
jects would be more attracted to them in the 
consistent conditions than the inconsistent 
conditions. If, on the other hand, the analy- 
sis of personalism is correct, subjects in the 
like condition would be attracted to partners 
in the consistent condition more than 
partners in the inconsistent condition, but 
Subjects in the dislike condition would like 
consistent partners less than inconsistent 
partners. As shown in Table 5, the pattern 
Of results is precisely as predicted by the 
personalism analysis, and the interaction be- 
tween self-consistency and liking is highly 
significant (5 < .001). 

The results of these studies provide support 
for the assumption that personalism via self- 
Consistency increases the impact of others’ 
feelings on the individual's self-esteem needs, 
People do prefer self-consistent evaluations 
from others when they feel that they are 
accepted or liked by others, However, the 
fact that inconsistent evaluations are pre- 
lerred when the individual finds himself dis- 
liked Strongly indicates that esteem enhance- 
ment and not maintenance of stable cognitions 


is the motivational force operating in these 
social relations, 


reported earlier in the Paper. Backman and 
Secord (1962) and Newcomb (1956) showed 
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support self-consistency theory, a self-esteem 
interpretation now becomes possible. To make 
this interpretation meaningful it is necessary 
to consider how much the individual felt he 
was liked by each of his peers. In these survey 
studies of ongoing relations, it is probably 
safe to assume that the individual’s liking for 
others is highly correlated with how much 
others like him or at least with his perception 
of their liking for him (see, e.g., Tagiuri, 
Bruner, & Blake, 1958, for evidence support- 
ing this assumption). From this assumption 
follows the possibility that the Backman and 
Secord, and Newcomb findings indicate that 
the more the individual likes and thinks he 
is liked by another person, the more he 
attributes similarity with regard to opinions 
about himself, This tendency falls into line 
with the pattern of results found by Potter. 
It suggests that self-consistent opinions are 
attributed to friends to enhance the value of 
their affection whereas dissimilar opinions 
tend to be attributed to enemies to defend 
against the sting of their rejection, 


“Warm” and “Cool” Studies of Social 
Evaluation 

The analyses of futur 
personalism suggest the conclusion that a 
variety of self-consistent results may be 
explained within the framework of self-esteem 
theory and that cognitive consistency motiva- 
tions play at best a minor role in interper- 
sonal evaluations of the sort discussed here. 
However, there may be limitations to the 
generalizability of this conclusion to some 
kinds of investigations of social perception 
and evaluation. There are a number of studies 
(e.g., Aronson & Cope, 1968; Jordan, 1953) 
Showing that people tend to hold consistent 
or balanced cognitions with respect to their 
evaluations of others. Indeed the correlational 
studies reviewed earlier (Backman & Secord, 
1962; Newcomb, 1956; and Wiest, 1965), 
although open to alternative interpretations, 
show similar consistency effects, A major dis- 
tinction which may be useful in those studies 
which obtain results supporting cognitive 
consistency theory from those supporting self- 
esteem theory involves whether or not the 
experimental subjects are themselyes targets 
of evaluative action from the stimulus person 


e consequences and 
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being evaluated. Experiments in which the 
subject is liked or disliked, agreed or dis- 
agreed with, benefited or harmed, or in other 
ways directly affected by the stimulus person 
may be considered “warm” studies. Since 
these subjects are to some extent the focus 
of actions having evaluative implications, 
Such studies should be interpretable within 
the self-esteem theory framework. However, 
experiments in which the subject is in no way 
personally affected by the stimulus person 
may be viewed as “cool” studies. In the lit- 
erature on person perception there are many 
Such investigations in which the subject ob- 
Serves but does not interact with some staged 
slice of life involving other people, then is 
asked to form impressions of these others 
based on his observations. Since there is no 
direct personal involvement and the orienta- 
tion of the subject is essentially clinical or 
diagnostic, these studies may well be more 
explicable in terms of cognitive consistency 
theory, 

This distinction is helpful in accounting 
for the pattern of results reported previously 
In Table 2, Those studies in which the sub- 
Ject estimates the reactions of others are rela- 
tively cool because the subject is neither 
Interacting with, nor receiving evaluative in- 
formation from others. Hence, his esteem 
needs are of less concern to him than the 
Cognitive task of making sense of the social 
World he watches, and, indeed, his attitudes 
in these situations do tend to be self- 
Consistent, Those studies in which the subject 
receives evaluations from others are warm 
Studies, The subject learns what others actu- 
ally think of him, he worries about his own 
€steem and worth, and, as the data indicate 
İn most cases, he is differentially satisfied or 
frustrated depending upon others’ evaluations 
and his self-evaluation. 

The variable differentiating warm and cool 
Studies—whether or not the subject is the 
cus of evaluative actions from the target 
Person—is easily manipulable, and the idea 
that warm situations will lead to esteem 
heory results whereas cool situations will 
ead to consistency theory results should be 
*Xperimentally tested. In addition, systemati- 
Cally applying this distinction to other studies 
N person perception should be undertaken 


before drawing any firm conclusions. If the 
difference proposed turns out to be meaning- 
ful, there is an interesting implication for 
the person perception literature in general; 
namely, the popularity of consistency theories 
in this area may be in part at least a function 
of the many studies conducted in which the 
subject is required to comment upon or evalu- 
ate a social game that he observes from the 
sidelines but in which he rarely participates. 
Furthermore, if living with others is a matter 
of playing the game rather than watching it, 
then the generality of consistency models to 
real life situations may be quite limited, 


SUMMARY AND CONCLUSIONS 


The purpose of this paper was to compare 
two rather broad theoretical traditions in so- 
cial psychology as they related to the determi- 
nants of interpersonal evaluations. On the 
one hand, proponents of self-consistency 
theory argue that the individual adjusts his 
cognitions and orients his relations with 
others so as to maintain similarity between 
his evaluations of himself and those he re- 
ceives from others. On the other hand, the 
assumptions of self-esteem theory are that the 
individual has a need for self-esteem which 
is satisfied primarily by the approval he re- 
ceives from others and is frustrated by their 
disapproval, and that to the extent that his 
esteem need is satisfied by others’ evaluations, 
the individual will respond favorably to them. 

In spite of the fact that the cognitive con- 
sistency approach has been widely accepted 
as a theoretical framework for these problems 
as well as many other problems in social 
psychology, I suggest that a stronger case can 
be made for the value of the self-esteem 
theory outlined in this paper. The evidence 
from experiments which permit a test between 
these positions provides more substantial sup- 
port for the self-esteem predictions. Time and 
again, the unhappy self-derogator seems to 
glow when praised and glare when censured 
even more than his self-confident counter- 
part. Furthermore, two rather straightforward 
extensions of self-esteem theory are capable 
of explaining apparently self-consistent inter- 
personal evaluations. First, the anticipated 
exposure of his qualities tends to make the 
individual more “honest” about himself (and, 
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therefore, more self-consistent) so as to avoid 
the eventual disapproval of others, 
people respond more favorably to positive 
evaluations and less favorably to negative 
evaluations to the extent that those evalua- 
tions are perceived as uniquely focused on 
themselves. Not only can these Propositions 
account for some of the data often cited as 
support for a self-consistency motivation, but 
they also point in the direction of sorting out 
self-consistent responses from other strategies 
designed to gain esteem enhancement, 
Whether or not these conclusions have 
implications for Other social psychological 
problems involving cognitive consistency 
models remains to be seen. Bramel (1968) 
has recently suggested that certain cognitive 
dissonance phenomena may reflect underlying 
needs for self-esteem enhancement through 
Social approval, He argues that “dissonance 
isa feeling of pe 
of anxiety) traceable to rej 


Second, 


and that the ‹ 
anxiety about one’s w 
Such things as selí-justification and the search 
for information that will reflect favorably 


upon the self Ip. 365|.” Deutsch, Krauss, 
and Rosenau made a similar 


similarities and dif- 
ious lines of analy- 
clearly articulated 
as sharpen the dis- 


tinction between it апа cognitive consistency 
theories, 


‘consequences of 
Orth are likely to be 


EPILOGUE 


I am good 
You love me 
therefore you are good, 


Iam bad 
You love me 


therefore you are truly beautiful! 
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COGNITIVE FACTORS IN ELECTRODERMAL CONDITION 


WILLIAM W. GRINGS 2 


University of Southern California 


An analysis is made of concepts central to explanation of cognitive determina- 
tion of electrodermal changes in conditioning situations. Past issues have 
emphasized verbally induced task sets, awareness of stimulus contingencies, 
and the possibility of simultaneous learning at different "levels." Current 
issues emphasize verbally determined changes in the perceptual significance 
of stimuli and changes in perception of relations among stimulus events. In 
conditioning situations, electrodermal activity is seen as a sensitive indicator 
of the momentary modulations oí arousal which accompany processes de- 
scribed by the terms of orientation, anticipation, expectation, preparation, 
and signal-induced adaptation or defense. 


After numerous early studies demonstrated 
the feasibility of conditioning the electro- 
dermal response, that response was adopted 
enthusiastically for studies of conditioning 
because it was assumed to be involuntary 
and free from the influence of subject verbali- 
zations. By the time of publication of Hilgard 
and Marquis’ (1940) summary of the field, 
a large number of traditional phenomena had 
been demonstrated with the electrodermal 
response. To mention a few, they included 
inhibition of delay and disinhibition, com- 
parison of trace and delay paradigms, stimu- 
lus generalization, spontaneous recovery, par- 
tial reinforcement effects, compound stimulus 
patterning, and semantic generalization. 

During the same period, there was an 
increasing concern for the fact that at least 
some of the observed results reflected phe- 
nomena more like verbal expectancy learn- 
ing (Cook & Harris, 1937) or preparatory 
(cognitive) sets for receipt of stimulation 
(Mowrer, 1938) than phenomena of “true 
conditioning.” As efforts were made to repli- 
cate and extend upon the earlier findings, 
researchers on electrodermal conditioning 
found themselves in the middle of a number 
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of theoretical-methodological issues involving 
cognition for the electrodermal response is 50 
sensitive to shifts of attention and the mo- 
mentary changes in arousal which accompany 
changes in cognition that it is impossible to 
eliminate cognitive factors completely from 
conditioning observational situations, At the 
same time, this sensitivity gives the electro- 
dermal response an advantage when it comes 
to direct study of cognitive variables. 

The present material is an examination of 
the role the electrodermal response has played 
in the study of cognitive factors in condition- 
ing. It extends upon an earlier discussion 
(Grings, 1965). An arbitrary selection. and 
classification of researches is made as a basis 
for examination of a few central issues. 
Where a choice of research paradigms is re- 
quired, the paired-stimulation arrangement of 
classical conditioning is given preference over 
paradigms of avoidance or escape learning, 
instrumental reward learning, and operant 
conditioning. The restrictions are not made 
io suggest a discontinuity among kinds of 
learning situations, but rather to gain simplic- 
ity of explanation for a limited range of 
phenomena. It is also true that the classical 
conditioning paradigm has been the most 
widely researched stimulation arrangement 
with the electrodermal response. 


BACKGROUND FOR CONSIDERATION OF 
COGNITIVE DETERMINERS OF 
CONDITIONING 


The first clear indications of cognitive 
learning of autonomic responses were reported 
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over 30 years ago. Cook and Harris (1937) 
manipulated the electrodermal response to a 
“neutral” stimulus by instructing subjects 
that the stimulus would be followed by a 
shock. They labeled the phenomenon “verbal 
conditioning" involving “ап association be- 
tween two verbalizations: for example, the 
verbalization ‘light’ and the verbalization 
‘receive a shock’ [p. 209].” They asserted 
further that “conditioning of the galvanic skin 
response in the human adult differs from the 
customary conditioning procedure [p. 209].” 

These early observations were put aside by 
many workers, and electrodermal condition- 
ing continued to be carried out assuming that 
subjects could be kept from making verbal 
associations by giving them instructions in- 
tended to mislead them about the purpose of 
an experiment, The results of such verbal fac- 
tors were considered to be artifacts and not 
really conditioning (e.g., Mowrer, 1938). One 
notable exception to this arose from an inter- 
est in practical questions of extinguishing 
undesirable conditioned autonomic responses 
Where conventional extinction procedures 
Were compared to what might be termed 
extinction plus knowledge" (e.g., Haggard, 


1943), 


EARLY STUDIES OF COGNITIVE FACTORS 
IN EXTINCTION 

In the previously cited study, Haggard 
(1943) investigated a variety of cognitive 
Variables affecting skin resistance phenomena 
in a situation associating shock with a verbal 
Stimulus, For some subjects the shock was 
administered by the experimenter; for others 
it was self-administered. Groups were later 
divided on whether they were aware of the 
Word-shock relation. Of three types of ther- 
ару conditions compared after conditioning, 
Опе involved permitting the subject to ask 
Questions, where “an invariable question was 
whether they would be shocked again” to 
Which the answer “no further shocks” would 
зе given. The results were quite complicated 
" cause of the wide variety of variables in- 
SLE, but several conclusions were evident. 
f e was that cognitive variables have an 
“portant effect on general reactivity (moti- 
vation, 4 hat the in- 


arousal) variables and t : 
while 


uenc "NT 
ence of cognition upon extinction, 


present, depended upon other variables (like 
task requirements). 

This difference between associative and 
motivational effects of instructions during 
extinction of the electrodermal response is 
characteristic of later studies. Silverman 
(1960) observed short (.5-second) and long 
(6-second) interstimulus-interval condition- 
ing groups and a pseudoconditioning control 
group when half of each group was extin- 
guished with “no more shock” instructions. 
Knowledge from instructions affected both the 
5-second and the pseudoconditioning group 
performance ( produced a decrement) whereas 
the 6-second group was not affected. The re- 
sult was interpreted as reflecting anxiety 
produced by the longer interval. 

Wickens, Allen, and Hill (1963) compared 
extinction with and without such “no more 
shock” instructions with two levels of un- 
conditioned stimulus (UCS) intensity. The 
two control groups (instructed and non- 
instructed) varied in response magnitude 
showing that one effect of instructions was 
to lower drive, Instructions speeded extinc- 
tions but did not produce it immediately. 
There was no clear effect of shock intensity. 
In a related study, Grings and Lockhart 
(1963) found a performance decrement at- 
tributable to instructions but no difference 
due to UCS intensity. The absence of an 
unpaired group made it impossible to separate 
motivational and associative effects of the 
knowledge about stimulus relations. 

Because many investigators use extinction 
measures as indices of conditioning acqui- 
sition, the role of instructions upon extinc- 
tion can be evaluated in a number of studies 
of related issues, like those of awareness and 
levels of learning, to be noted later (e.g., 
Bridger & Mandel, 1965; Fuhrer & Baer, 
1969; Wilson, 1968). 


Task SET 


Traditionally one of the most frequent 
interpretations of cognitive influences in 
conditioning has been in terms of facilitating 
or inhibiting effects upon performance 
through the subject’s set toward his task. 
For eye-blink-conditioning situations these 
results were extensively discussed — (e.g., 
Hilgard & Humphreys, 1938; Norris & Grant, 
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1948). The general results have been clear 
evidence of inhibition and facilitation by 
instructional variables. 

Two recent articles reporting on the 
electrodermal response illustrate this type of 
concern. Hill (1967) discussed conditioning 
with her subjects, then told half of them that 
"the inteligent thing is to become condi- 
tioned,” while telling the other half that “the 
intelligent thing is to not become condi- 
tioned." The data revealed differential re- 
sponding in accordance with instruction, the 
favorable group responding at a consistently 

higher level than the antiinstruction group. 
The author viewed the instruction effect as 
predominately motivational. Dawson and 
Reardon (1969) repeated Hill's instructions 
and added a control group and a neutral 
instruction. group. Both the inhibitory and 
facilitory group surpassed the control 
group, but neither differed significantly from 
a neutral-instruction-conditioning group, al- 
though the facilitation group performed sig- 
nificantly better than the inhibitory group. 
They also found differential unconditioned 
response (UCR) habituation in the two con- 
ditions (inhibiton and facilitation). A later 
study (Harvey & Wickens, 1971) suggested 
the possibility that the superior performance 
of the “pro” group may be due to a non- 
associative (drive) variable. 


AWARENESS 


A frequently researched question during 
the past two decades concerns the concept of 
awareness and the role it plays, possibly as 
a necessary condition to autonomic learning, 
Put in its strongest form this argument 
asserts that all autonomic conditioning is 
verbal expectancy learning and that aware- 
ness of conditioned stimulus (CS)-UCS 
relations is necessary for conditioning to 
occur. Experiments to that end for both 
electrodermal response and heart rate condi- 
tioning are typified by those of Chatterjee 
and Ericksen (1960, 1962). Numerous dif- 
ficulties stand in the way of proving that 
awareness of CS-UCS relations is necessary 
for autonomic conditioning to occur. Tt be- 
comes important to demonstrate an absence 
of conditioning when there is an absence of 
awareness. Given the absence of conditioning 
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under unaware circumstances, it is always 
possible that the other necessary conditions 
ior learning are also not present (like per- 
ception of, or attention to, stimuli; and an 
adequate number of trials for learning to 
occur). The experiments tend more to prove 
that cognitive awareness factors are sufficient 
and potent than that they are necessary. 

Some of the difficulties in this type of re- 
search can be seen from a brief look at a 
sample of studies. In early investigations, the 
variable of awareness was defined by sepa- 
rating subjects into aware and unaware 
groups on the basis of postexperimental inter- 
views (e.g Diven, 1937; Lacey, Smith, & 
Green, 1955). There are many weaknesses in 
this procedure. Potential sources of bias exist 
during the interview and the locus of the 
information collection makes it possible that 
learning could occur first, then produce the 
awareness. Another approach collects inter- 
trial reports and attempts to verify the propo- 
sition that learning occurs on the trial on 
which awareness occurs. Such a procedure 
has been used successfully in studies of verbal 
conditioning (i.e., nonautonomic) by DeNike 
(1964), but efforts to use the method with 
the electrodermal response have not been 
fully successful (Fuhrer & Baer, 1965; 
Shean, 1968), requiring the investigators 10 
base their principal arguments upon post- 
experimental accuracy in reporting CS-UCS 
relations, 

Manipulative definitions of awareness have 
been of two main types. One, based on dif- 
ferences in instructions about stimulus rela- 
tions, has already been noted above to en- 
hance performance but to be difficult to 
evaluate in the opposite direction, that is, in 
terms of preventing awareness. There İS 
nothing in misleading or disguising instruc- 
tions to guarantee that a subject will not 
catch on to the stimulus relations. As a result, 
a second form of manipulation, the use of а 
masking task, has recently seen emphasis. Jn 
an early evaluation of the effect of a masking 
task in electrodermal response conditioning, 
Dawson and Grings (1968) administered 
conditioning stimuli while subjects were 
taking a paper-and-pencil test. Their subjects 
showed no differential response to reinforced 
and nonreinforced stimuli until they were 
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instructed of stimulus relations. However, the 
number of trials were few and the possibility 
existed that subpects were distracted by the 
task to а degree that might keep them from 
perceiving the stimuli. 

To overcome the weakness just mentioned, 
Dawson (1970) devised a masking task in 
Which the subject made discriminative judg- 
ments among members of a series of tone 
stimuli, and reinforcement conditions were 
introduced by pairing shocks with one of the 
tones, Further manipulation by instructions 
showed conditioning only when awareness was 
introduced, In a related study, Fuhrer and 
Baer (1969) employed differential electro- 
dermal response conditioning in a group who 
had conditioning embedded in a masking 
task (probability matching) and a compari- 
Son group who underwent conventional condi- 
tioning, Content analysis of postexperimental 
interviews showed that 62% of the subjects 
in the conventional conditioning group accu- 
rately verbalized CS-UCS relations while only 
30% of the masking-task group did. The 
Matching-task group showed poorer condi- 
tioning, although in both groups conditioning 
Occurred, The last study demonstrates the 
effectiveness of the masking task in manipu- 
lating awareness, but it also suggests the 
limitations of the procedure in eliminating 
awareness, 


LEvkLS OF CONDITIONING 


A significant issue in conditioning theory 
has been concerned with the possibility that 
€arning may occur at more than one level, 
And that perhaps behavior change may be 
Soing on at more than one level at the same 
time. Such a possibility had been suggested 
by Razran (1955) when he spoke of condi- 
Erg with perception, for example, percep- 

n of stimuli, and perception of relations 
etween stimuli. 
"pos Razran (1961) made clear that he 
occu Dat primitive forms of conditioning may 
Occur without awareness and may serve as 
ae for some unconscious process. iHe 

ended these views (1965, 1971) to what 
Шы nel evolutionary levels of learning. is 

"Unctions made here bear resemblance to 
- difference proposed by Pavlov between 
>t and second signal system conditioning, 
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the first level involving direct sense impres- 
sions and the second involving symbolic and 
meaningful stimuli. It was implicit in Pavlov's 
theorizing that learning with the second signal 
system may take precedence over, inhibit, or 
at least interact with learning of the first 
signal system. 

One of the arguments favoring a separa- 
tion of types of learning was made by 
Bridger and Mandel (1964). They measured 
the electrodermal response of two groups of 
subjects. Both groups wore electrodes for 
receiving an electric shock and both were told 
that a shock would be delivered after a par- 
ticular signal stimulus. Then, even though 
one group was given the shock and the other 
had only the threat of shock, both groups 
gave electrodermal responses to the signal 
stimulus that were greater than those given 


` to the control stimulus. This was viewed as 


evidence for conditioning. Then both groups 
were told that there would be no more shocks 
and another series of signal trails was ad- 
ministered. The fact that the threat group 
extinguished and the shocked group did not 
was taken as evidence that different types of 
learning had occurred—one due to verbal 
íactors only and the other due to actual 
experience with shock. 

Extending upon this work and again using 
electrodermal response, Bridger and Mandel 
(1965) reported a study in which instructions 
to the effect that no more shock would occur 
(accompanied by removal of the shock elec- 
trodes) led to the abolition of a partial re- 
inforcement extinction effect as compared to 
the retention of the effect in a noninformed 
group. The presence of a conditioned electro- 
dermal response during extinction in the in- 
formed group combined with the abolition of 
the partial reinforcement effect was inter- 
preted to indicate a two-component condi- 
tioned response characterized as a simple 
conditioned response and a mediated condi- 
tioned response. 

In a more recent study by Mandel and 
Bridger (1967), six groups of subjects were 
studied, two with a 5-second interstimulus 
interval, two with a j-second interstimulus 
interval, and two with a backward inter- 
stimulus interval. All groups were informed 
that the experiment contained three parts: 
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iod when responses to a light alone 
a кол (preconditioning "s 2 
CS); (b) a period when the light Hrs a 
associated with electric shock; and p 
(c) a period during which responses а т 
light alone would again be recorded. All sub- 


group i 
the experiment continued without 
interruption, A postexperimental inquiry was 
conducted to eliminate those Subjects in the 
informed did not believe the 


Inter: 
stimulus interval groups, the 5-second group 
€monstrated 4 larger differentia] response 
urin 


ur oninformed con- 
dition whereas the 4-second groups ha 


Tesponse under the in- 
formed condition, The backward-conditioned 
response did not demonstrat 


bectancies, 
differentia] 


expectancy was “considered be evi 


a level of GSR conditioning analogous to 
Owrer’s (193 | 


Concept of true learni 

[Mandell T, 1967, p, 42],^ е 
angers in drawing the above 

* procedure is simi- 
lar to the difficul y tana 
not occur 

n the latter 
es 
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inferences were made from a lack of bise 
for extinction, and it becomes necessary t 
assume that the subject does not have e 
tive expectation, With human subjects sti. 
form of assumption is difficult to ie E 
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the experimenter told them a falsehood. odd 
is, they might have really expected a 5 oad 
but have answered according to the dem 
characteristics of the situation. tudies 
Tt should also be pointed out that = tà 
in which instruction variation is press often 
be the primary independent variable 1 { other 
Subject to inadvertent confounding instance 
types. An example in the present Fw 
might be disinhibition (or dishabitua ep, 
will be noted that the informed Loa odi the 
interrupted for a period of time ا‎ 
uninformed subjects had no such inte towa 
Disinhibition or changes in attitude 


e 
А А ilv Ir du 
the experiment might readily occ 

interruption, 


he important question, however ions 
Whether this form of experimental A уре | 
tion indicates the existence of two ctancit* 
learning, one based on cognitive ЕРЫП! jt 
and the other а noncognitive contita han js 
is concluded here that the evidence ? resulting 
equivocal. Tt can be explained ed throug 
entirely from learning, developec dified к 
cognitive expectations, which is pm ой 
varying degrees during a rst 
ance (rather than involving cogniti i 
noncognitive processes), The data varia 
strongly, however, that cognitive extinc 
are important in determining € 
performance, 
Wilson 
the “ 
for e 
simpl 


remains 


tio! 


sia tO) 

ye 
гпабу < 

(1968) suggested ап alte n 


i 

‘action instruct” ү 

no more shock” extinction jbutions is- 
valuating the relative сойт. А 


а 
e 3 sing “ er- 
* and cognitive pon La int 


crimination Paradigm and a he subjec" 
stimulus interval, he instructed en ГЕ, 
prior to acquisition and prior i to Ше 
At the beginning of extinction pue ot 
that a discrimination reversal W and -' d 
(e Qs would now be shocked finis ТЕЛ 
would not be shocked). He then 2 “а sudd g 
stimuli Without shock and found n цол, 
reversal of the differential CO 


xe: ing evo 
GSRs of ere: eing 


iter magnitude b 


RE 
Ё 


< 


Z^ 


COGNITION AND ELECTRODERMAL CONDITIONING 205 


CS— |p. 491]. He concluded that th 
"study finds no evidence to support the con 
tention that some part of a conditioned auto- 
nomic response is ‘simple’ in the sense that 
it is not mediated by S's perception of stimu- 
lus contingencies [p. 493].” Subsequent study 
of the reversal procedure (Grings, 1973) has 
shown a clear dominance of cognitive vari- 
ables for both long and short interstimulus 
intervals, and when reversal learning groups 
(negative transfer experience) are compared 
to groups whose previous training was pseudo- 
conditioning (or nontransfer experience). 

Another interpretation about levels was 
made by Fuhrer and Baer (1969) in the 
previously discussed study using a masking 
task. They argued that cognitive factors 
manifest themselves differently in the various 
components (topography) of the electro- 
dermal response. They found that in their 
masking-task group both aware and unaware 
Subjects show significant discrimination of 
first response (responses occurring just aíter 
the CS) but not significant second responses 
(occurring just before the onset of UCS). 
They conclude that they have demonstrated 
unaware conditioning. 


INDIRECT APPROACHES TO “AWARENESS” 
VARIABLES 


Most of the evidence considered thus far 
has relied upon manipulations of environmen- 
tal conditions which are presumed to influ- 
ence the subjects cognitive awareness or 
Derception of the experimental situation. 
Such studies have been most numerous in 
electrodermal response. conditioning. There 
is, however, considerable information from 
Other types of studies—among them observa- 
tions of special populations of verbal-concep- 
tually handicapped subjects, studies using 
Conditioned stimuli administered interocep- 
tively, and research using the electrodermal 
Tesponse as an index of expectation or 
alertness, 

In early efforts to evaluate the role of 
Verbal factors in electrodermal response con- 
ditioning the writer used two special popula- 
tions of subjects who were deficient in verbal 
Capabilities. One of these was composed ol 
totally deaf preschool-age children who, be- 
Cause of their deafness, were not equipped 


with verbal communication skills and who, 
because of age and deafness, would be ex- 
pected to be very unsophisticated about con- 
ditioning (Grings, Lowell, & Honnard, 1961). 
These children showed conditioning as rapidly 
and reliably as college-age normal subjects. 

The second investigation was made of ado- 
lescents confined in a state hospital for the 
mentally deficient. Two samples were chosen. 
One had an average IQ of 63 and possessed 
minimum verbal skills for ordinary communi- 
cation. The other had a mean IQ of 34 and 
was almost completely nonverbal. It was 
anticipated that a difference in rate of acqui- 
sition of an autonomic conditioned discrimi- 
nation would result from a difference in 
verbal ability. This did not occur, for both 
groups conditioned rapidly and reliably 
(Grings, Lockhart, & Dameron, 1962). Simi- 
lar results have been reported by Baumeister, 
Beedle, & Urquhart (1964), whereas Lobb 
and Nugent (1966) and Lobb (1968) report 
an interaction of intelligence and inter- 
stimulus interval. 

Still another avenue into the domain of 
cognitive-autonomic relations exists through 
the distinction between interoceptive and 
exteroceptive conditioning. Most human 
studies employ as conditioned stimuli extero- 
ceptive stimuli like lights and tones. Few 
have used stimuli delivered interoceptively as 
has been done extensively with lower animals 
(e.g, Bykov, 1957). Stimuli to the interior 
of the body have a unique property in terms 
of awareness learning, for such stimuli are 
typically reported as only vague and diffuse 
in conscious experience. 

Uno (1970) conducted a study comparing 
exteroceptive and interoceptive conditioning 
of the electrodermal response and made ex- 
tensive observations of the role of perception 
and awareness. The interoceptive stimuli were 
different temperatures of a water-filled bal- 
loon, swallowed by the subjects and located 
midway in the esophagus. Exteroceptive CSs 
were tones of different frequencies, and the 
UCS was an electric shock. Differential 
conditioning was obtained to both intero- 
ceptive and exteroceptive stimuli. Analysis of 
subgroups in terms of their awareness of 
stimulus (CS) differences or of CS-UCS rela- 
tions showed that unawareness of interoceptive 
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stimuli or their reinforcement contingencies 
tended to eliminate the first response (the 
conditioned response occurring in the delay 
interval) but did not eliminate conditioning 
of the UCS omission response (the condi- 
tioned response occurring on trials where the 
UCS is omitted). Only one response was mea- 
sured in the delay period so that a direct 
comparison cannot be made with the conclu- 
sions of Fuhrer and Baer (1969) about 
awareness and response topography. 

There are other forms of research para- 
digms which have much in common with pre- 
viously discussed manipulations of cognitive 
awareness, but which are typically not con- 
sidered in connection with the above issues, 
possibly because the subject is assumed to 
be aware. Some of these will be noted briefly. 

One is the motor avoidance task in which 
a CS serves as the signal for response. It is 
possible to observe conditioned anticipatory 
electrodermal responses if the time between 
CS and receipt (or nonoccurrence) of noxious 
Stimulus is sufficiently long. Conclusions 
about awareness can be made from the cor- 
rectness of the avoidance response, As an 
example of this type, Grings and Lockhart 
(1966) had subjects operate a four-position 
Switch which was signalled by four visual 
stimuli: one always followed by shock, one 
never shocked, and two associated with 
Switch positions which avoided the shock. 
Successive subgroupings exhibited progres- 
sively better avoidance learning with in- 
creased awareness instruction and showed 
significant decreases in magnitude of antici- 
patory electrodermal responses at the point 
of learning the avoidance contingency. 

A related study (Colgan, 1970) produced 
similar results by making the receipt of 
shock predictable to the subject by instruct- 
ing him about stimulation changes. With 
three distinctive stimuli, each was followed 
equally often by shock during an initial 
period of trials. Then half of the subjects 
were instructed that shock would follow only 
one stimulus and the other half were in- 
structed that they would have a second series 
like the first. The preshock electrodermal re- 
sponse for the instructed groups stayed at a 
high magnitude for the cue still followed by 
Shock and dropped sharply for the other two 
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signals. For the other subgroup no between- 
stimulus differences occurred. 

Any situation in which there is a distinctive 
signal for action to occur after a delay will 
provide electrodermal responses similar to 
those in more conventional delay condition- 
ing. Two outstanding examples are the 
method of Ivanov-Smolensky where the signal 
to execute an instructed command takes the 
place of the conventional UCS (an example 
with electrodermal response conditioning 
would be that of Skorunskaia, 1958); and the 
reaction time task where the warning signal 
for the reaction time task can be made to 
have the properties of a discriminative CS 
(e.g, Baer & Fuhrer, 1969; Martin, 1965). 


Related Theoretical Issues 


The discussion to the present point permits 
the conclusion that in electrodermal condi- 
tioning situations many cognitive variables 
are of theoretical importance. The variables 
which have been identified stress the impor- 
tance of conscious awareness and of verbally 
induced sets upon performance. It is clear 
that. verbal expectancies develop in paired 
stimulation (conditioning) situations, and 
there is strong evidence that electrodermal 
response conditioning changes accompany 
these verbal awarenesses. Whether any learn- 
ing of electrodermal behavior occurs in the 
absence of awareness is an open question. 
Of major importance, is the question of just 
how the cognitive factors operate in deter- 
mining autonomic behavior change. 

At the present time there is no compre- 
hensive miniature theory to account for cog- 
nitive learning of autonomic behavior. There 
are, however, strong links to points of view 
that emphasize sign-significate learning (e.g 
Tolman, 1932) and sequence-learning inter- 
pretations of conditioning (e.g., Grings, 1963; 
Woodworth, 1947) and with learned orient- 
ing and alerting behavior to stimuli (e.g. 
Sokolov, 1963). 

What is central to all of these points of 
view is that the conditioned stimulus serves 
as a signal which informs the subject of the 
appearance of the second stimulus (the 
UCS). Some discussions about the nature of 
electrodermal conditioning (eg,  Grings, 
1960, 1969; Grings & Sukoneck, 1971) em- 
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phasize concepts of perception, set, and ex- 
pectancy based on temporal contingencies 
operating in the individual's environment. 
The autonomic responses are seen as arousal 
correlates to the preparation and expectation 
process, This suggests the organization of 
concepts underlining such learning in terms of 
perceptual variables of at least two main 
types: perception of stimuli and perception 
of relations or contingencies. 

Paired stimulation experiences are assumed 

to change perceptions of the stimuli involved. 
A signal stimulus may take on the proper- 
ties of the stimulus which it signals. Verbal- 
izable properties of a conditioned stimulus 
appear to show alteration due to association 
with the UCS. This may occur with non- 
symbolic as well as with symbolic stimuli. As 
an example of such change in nonsymbolic 
stimuli, one might note the work of Beck 
and Brooks (1967) and Sutterer and Beck 
(1970) who found semantic differential rating 
Changes for neutral stimuli preceding and 
following a noxious stimulus. Generally 
Speaking, the signal which is antecedent to 
an unpleasant event changes its affective 
quality toward unpleasantness, whereas the 
Opposite is true for a stimulus following 
Such an event (e.g, the change is toward 
Pleasantness). 
Р Shift in meaning of a symbolic stimulus 
implying a contingent event can be seen to 
Change its arousal properties differently as 
a result of general exposure to the event, or 
graded experiences with the event (Grings & 
Zeiner, 1965). This type of phenomenon can 
be demonstrated to occur with symbolic 
Stimuli not themselves reinforced but in- 
cluded in a group of reinforced stimuli 
(Grings, Carlin, & Appley, 1962). In other 
cases the intrinsic meaning of symbolic stim- 
uli may determine the subject’s perception of 
the stimulus sequence (e.g., Grant, 1972). 

There is also support for the assumption 
that the perceptual properties of a noxious 
Stimulus (e.g., the UCS) will change as a 
Tesult of paired stimulation with a signal 
Stimulus, For example, it has been generally 
reported that an electrodermal response toa 
Noxious stimulus, like an electric shock, is 
less in magnitude when that stimulus ipte 
ceded by a signal than it is when it is pre- 


sented alone. The phenomenon, termed UCR 
diminution (e.g., Kimmel, 1966) is compli- 
cated by difficulties arising from peripheral 
response interference (e.g., Grings & Schell, 
1969) and orienting behavior (Badia & 
Defran, 1970). 

In at least one study the rated intensity 
of the UCS was found to change in the same 
direction as the autonomic response (ie., as 
the electrodermal response became less, the 
stimulus was rated less intense) (Schell & 
Grings, 1971). Other studies show different 
results (e.g., Kimmel, 1967; Furedy & Doob, 
1971). One basis for predicting enhancement 
rather than diminution has been suggested in 
the situation where signal stimuli are followed 
by a UCS different from that with which 
they had previously been associated (Grings, 
1960). 

A theory for explaining both directions of 
response change (diminution and enhance- 
ment) in terms of perceptual changes has 
been proposed by Lykken (1962, 1968). His 
central concept is termed “preception” and 
involves a hypothetical mechanism for ampli- 
fying or attenuating the subjective intensity 
of predictable stimuli, Other evidence for the 
effect of stimulus pairings upon perception 
of the signaled stimulus comes from studies 
of preference for signaled and unsignaled 
shock (e.g, Badia, McBane, Suter, & Lewis, 
1966); and judgments of stimulus intensity 
under different circumstances of warning 
(e.g., Epstein & Clark, 1970). 

A central focus of research on awareness 
variables was upon perception of relations 
among stimuli or of contingencies. One im- 
mediate question concerns the manner in 
which the relation is learned, that is, how 
the subject becomes aware of a contingency. 
Most research to date falls into one of two 
groups. In one, learning is by some self- 
discovery method and in the other learning 
is by some form of verbal guidance. Common 
observation indicates that knowledge of rela- 
tions or contingencies alone may be a weak 
modifier of behavior. Additional research 
concepts are needed which relate to the 
strength of belief about the knowledge of 
the contingency. One way such a variable 
can be studied is through reports of certainty 
by the subject. Other ways may be developed 
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Írom considerations of the properties of the 
contingency relations, 

One important feature of the relation in 
the case of simple sequence association is the 
information that is contained in the sequence. 
Two major kinds of information occur, One 
concerns the qualities or characteristics of 
the event to follow and the other arises from 
the certainty elements of the relation. In the 
last category at least three forms of certainty 
are available for research: event certainty, 
time certainty, and quality certainty, Exam- 
ples of event certainty can be developed 
through manipulation of the proportion of 
times that the second event follows the first 
(a concept somewhat analogous to percentage 
of reinforcement, except that it might include 
pattern as well as rate of occurrence of pairs, 
etc.). Time certainty can be defined by the 
difference between a constant interstimulus 
interval and a variable interstimulus interval, 
for the arrival of the unpleasant signaled 
event will be more uncertain with variable 
interstimulus interval than with constant 
interstimulus interval, Quality uncertainty 
relates to the extent to which the qualitative 


ing constant intensity and variable intensity 
second stimuli. Recent experiments to evalu- 
ate these informational variables in the 
present context manipulated event certainty 
(eg., Epstein & Roupenian, 1970; Grings & 
Sukoneck, 1971), time certainty (e.g., Grings 
& Schell, 1971), and quality certainty 
(e.g., Epstein & Clarke, 1970: Ohman, 1971), 

In brief, it is concluded that cognitive 
determination of autonomic conditioning re- 
sults from the central fact that the CS con- 
lains information properties which can be 
conceptualized in terms of knowledge of 
stimulus relations and knowledge of stimulus 
properties, Experimental manipulation can 
be achieved through Concepts of certainty, 
subjective probability, and information trans- 
mission, There is an implied role played by 
the signal. In aversive situations this is 
Postulated as controlling (modulating) the 
arousal characteristics of the Strong (un. 
conditioned) stimulus, Finally, there is an 
implication that the response to the uncondi- 


tioned stimulus is dependent to a degree upon 
the information provided by the signal. 
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COMMENT ON “LINGUISTIC DEFICIENCY AND THINKING: 
2d RESEARCH WITH DEAF SUBJECTS 1964-1969” * 


HARRY BORNSTEIN лхо HOWARD L. ROY ? 
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A paper by Furth reviewed 39 studies in which the cognitive performances 
of deaf children were compared with those of hearing children, With some 
exceptions, deaf children performed as well as hearing children on the tasks 
set for them. Because Furth believes that deaf subjects are linguistically 


2 deficient, he concluded that the thinking processes oi deaf children are similar 
to those of hearing children and therefore must be explained without recourse 
{о verbal processes. He offers as evidence for language deficiency one stan- 
dardization study which shows that the mean reading achievement oi deaf 
students on a nationally standardized achievement test falls below fourth- 
grade equivalence. This paper argues (а) that the achievement test was 
normative referenced and does not provide evidence for an inability to handle 

= English sentences and (b) that there exists still other evidence of deaf stu- 
dent mean achievement at fourth- and fifth-grade equivalence. We contend 
that deaf subjects cannot be regarded as language deficient without explicit 
demonstration to that effect. Failing this, the reviewed studies cannot be 
said to contribute evidence for or against the hypothesis that language is not 
related to or required for cognitive development. 

In a recent issue Furth (1971) reviewed a directed at that section, but it should be 
number of studies in which the cognitive per- noted that we are not necessarily opposed to 
formances of deaf children were compared the point of view that cognitive processes or 

22 with those of hearing children, The tasks in- reasoning ability are not dependent upon 
volved in these studies were categorized as: language. We do object, however, to Furth's 
rule learning, logical symbols, Piaget-type contention that deaf subjects are lacking in 
tasks, memory, and perception. With some the language skills necessary or appropriate 
exceptions, deaf children performed as well as for the cognitive tasks in the reviewed studies. 
hearing children on the tasks set for them, In our opinion, the data obtained from deaf 
On this basis, Furth concluded “that think- subjects in these studies do not provide 

Ф ing processes of deaf children are similar to adequate tests of his thesis. 


those of hearing children and therefore must 
be explained without recourse to verbal 
Processes [p. 70].” 

The fundamental rationale for this conclu- 
sion rests on the assumption by Furth that 
deaf subjects are linguistically deficient. 
Thus, any cognitive task on which deaf per- 
Sons do as well or better than hearing sub- 
jects can be viewed as a performance which 
does not require language. He devotes a con- 
Siderable section of his article to defining and 
Supporting this conclusion. Our comments are 


"The authors are grateful to James M. Pickett, 
Horace N. Reynolds, William C. Stokoe, Jr. and 
Augustine Gentile for their suggestions and comments 
pn earlier drafts of this paper. 


ollege, Kendall Green, Washington, D. С. 
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One of the first studies reviewed by Furth, 
a study by Blank and Bridges (1966), was 
concerned with learning in situations involv- 
ing absence (negation or numbers). The rele- 
vant cue was the presence or absence of 
stimulation (light or vibration). The deaf 
children (3-6 years of age) did as well as 
the hearing children. The authors concluded 
that deaf children, although deficient in 
general language, succeeded “not because 
they had a non-language concept, but rather 
because they possessed the appropriate sym- 
bol (e.g, word, gesture) to designate the 
concept [р. 59]." Interestingly enough, 
Furth does not quarrel with this conclusion. 
It seems, therefore, that these deaf subjects 


Bo 4088 for reprints es has o eR are linguistically deficient only if one equates 
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TABLE 1 


Mean GRADE EQUIVALENTS ny TEST BATTERY LEVEL FOR THE PARAGRAPH MEANING SUBTEST 


OF THE STANFORD ACHIEVEMENT TEST FOR HEARING IMPAIRED STUD) ENTS 
WITH BETTER EAR Loss OF 60 AND ABOVE 
Intermediate I level | Intermediate H level Advanced level Weighted mean of means 
ме | \ Mean grade ; Grade 
= | ы equivale: equivalent " equivalents 
2%) 159 | зз | | 1% | 
13, 222 | 3.39 | 166 | 4.08 | 388 | 
14 174 3.20 | 179 | +43 353 
15 191 3.24 192 4.31 | MS | 
16 151 | 3.26 | 200 | 4.30 | 609 
17 | | 148 +10 | 30 | 
18 | 235 | 
19 | | 159 
| - n NAM | EN ! кари 
Nole. Adapted from A. Gentile & 


United States: Spring 1969, (Series D. 


tence structure, In fact, however, the deaf 
students in this study were not linguistically 
deficient in terms of the tasks required. They 
knew the word or gesture needed to perform 
the cognitive tasks presented to them. We 
think it is incumbent upon Furth to demon- 
strate for each such Study that a word, 
phrase, or series of phrases is not sufficient 
language to perform the task required. Until 
this has been established, no conclusion about 
language deficiency relative to cognitive func- 
tion can be drawn from using deaf students 
as subjects. 

There is another matter which gives us 
Some concern. It is Furth's definition. of 
language and his evidence in support of it. 
He states that *.., comprehension of 
fourth-grade reading as measured by present 
standardized tests can be suggested as an 
appropriate criterion [of language compe- 
tence] .. . [p. 69]." He then offers results 
from a single standardization study based 
upon a sample of 5.284 deaf pupils between 
the ages of 10} and 16} conducted in 1963. 
The test was the Metropolitan Elementary 
Reading Test. The mean grade equivalent of 
the oldest group was 3.6 and this represented 
very little change from the performance of 
the younger groups. Furth considers these 
findings as conclusive evidence of language 
deficiency, Two points can be noted: First, 


the Metropolitan (as well as the Stanford 


Achievement Test which will be cited later) 


ment lest performance of hearing impaired students in the 
Gallaudet College, Office of Demographic Studies, 1969, 


is a normative and not a criterion referenced 
test. Test scores expressed as grade equiva- 
lences represent a performance which 15 
"equivalent" to some modal score achieved 
by children at a given grade—in this case 3.6- 
It is very difficult to describe what this level 
of skill really represents, but if the reader 
doubts that materials included in a third- 
grade reader are made up of sentences, he 
need merely glance at such a book. Further, 
test constructors normally do not interpret a 
grade equivalence score of 3.6 to mean that 
the group cannot read some fourth-, fifth-, 
or sixth-grade material. Normative referenced 
tests are simply not designed to furnish such 
conclusions, Interested readers may refer. to 
Angoff (1971) for a current scholarly review 
of permissible interpretations of age and 
grade equivalence scores, The second point 15 
that more recent evidence on the reading 
abilities of deaf students differed from the 
earlier Metropolitan Test findings, Gentile 
and DiFrancesca (1969) conducted the first 
annual survey of Stanford Achievement Test 
performance of hearing impaired students in 
the United States. Test results for 11,544 
pupils from ages 7 to 19 were reported. 
Table 1 is based on data from that report 
and shows mean grade equivalents by age 
for the Paragraph Meaning subtest of the 
Stanford Achievement Test. This subtest 15 
included at several different test battery levels 
and is administered by the school at а level 
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thought appropriate for any given child. The 
data in Table 1 deal only with those students 
With hearing losses of 60 decibels or more in 
the better ear. The data do not show an in- 
Spiring reading performance. However, two- 
thirds of the age means are at the fourth and 
higher grade levels. Fifth grade appears to 
be the mean achievement ceiling. We are not 
prepared to say that these data are more 
valid than those secured earlier with another 
test, but surely the evidence does not point 
uneqivocally to reading achievement below 
fourth-grade equivalence, whatever that may 
mean in absolute terms. 

It seems to us that Furth confuses inade- 
quate performance or limited competence in 
handling the grammatical structure of English 
with deficiency or inability to form meaning- 
ful English sentences. We believe that it is 
Scientifically more useful and more consistent 
With current linguistic theory to view the 
Seneral] structure of English as made up of 
a very complex set of phrase structure and 
transformational rules. Deaf persons differ 
from hearing persons in their ability to use 
these rules. There is a considerable body of 
direct information available on which rules 
and in what settings the deaf have difficulty, 

here is, however, no direct empirical infor- 
Mation available on a claimed inability of 
deaf persons to handle the syntactical and 
Srammatica] structure of the English lan- 
guage, possibly because it is a global concept 
Which has not been operationally defined. 

Any discussion of the language of deaf 
Persons is further complicated by the fact 
that Many or most of them have a second 
@nguage system available to them, that is, 


the American Sign Language. Furth (1971) 
States: 


е Conventional gestural system sign language in 
hoo onê deaf students is only available in child- 
= to those deaf children who are born of deaf 
‚ Т°п{з—д very small percentage of the deaf popula- 


ti m 
qe Hence it is literally true that most deaf chil- 
li Cn grow up during the early formative period of 
ife w 


ithout any systematized conventional language. 
n Significant fact is that in spite of not having 
deaf Pro) ided with a conventional symbol structure, 
к Children construct their own symbols as they 
d for the development of thinking. On this 
Act rests the present writer's interpretation 


Dr. : 
10 reviewed studies Гр. 691. 


TABLE 2 


BACKGROUND AND SIGN LANGUAGE FLUENCY 
INFORMATION ON 1968-1969 GALLAUDET 
ENTERING STUDENTS (N = 220) 


Background Information n 96 
Sign language ability 
Fluent 114 52 
Fairly well 71 32 
A little 18 8 
Notat all 14 6 
No response | 3 2 
Kind of school attended 
Public hearing school 58 26 
Day school for deaf 35 16 
Residential school for deaf 135 61 
No response 8 4 
Total 236 107 
Age sign language learned 
0-5 years 51 23 
6-10 79 36 
11-15 38 17 
16-20 28 13 
Over 20 3 1 
No response 21 10 
From whom learned 
Parents 29 13 
Staff 81 37 
Friends 139 63 
Other 13 6 
No response 9 4 
Total 271 133" 


Note. Reprinted from Н. Bornstein & B. pell. N 
signs for instructional purposes. (Final Report, OE Project 
No. 6—1924) Washington, D. C.: United States Office of 
Education, 1969. к 

a Some students attended more than one kind of school or 
learned the sign language from more than one kind of person. 


There are several observations which can 
be made about this statement. First, no 
formal evidence is offered to support it. 
Second, while no age numbers are provided 
for “the early formative period of life," the 
really crucial consideration is the age of the 
students involved in the reviewed studies. 
Put in another way, is it not possible that 
the students studied were provided with a 
usable conventional symbol structure by 
others? An analysis of the upper and lower 
extremes of the range of ages of the children 
serving as subjects for the reviewed studies 
indicates that only the youngest children in 
eight studies were below 6 years of age, the 
normal beginning age for school attendance. 
The very youngest children in the remaining 
31 studies are school-age students. There is at 


HARRY BORNSTEIN 


least one survey by Bornstein and Kannapell 
(1969) which has attempted to determine 
when and from whom deaf students learned 
the sign language. It is summarized in 
Table 2. The students were the entering 
1968-1969 class at Gallaudet College, not by 
any means a representative sample of deaf 
pupils. However, the data in Table 2 do 
reveal some interesting information about how 
and when the sign language is learned in the 
United States. Most of the group report that 
they learned the sign language after reach- 
ing the school setting from friends and 
School staff, 

Since school age corresponds to the age of 
most of the subjects participating in the 
Studies reviewed by Furth, it is very pos- 
sible that this linguistic system was indeed 
available to some or many of the subjects. 
Just how many would know the sign language 
is difficult to determine, but it is certainly 
Furth's burden to demonstrate that the sub- 
jects did not have this linguistic system at 
their disposal if he concludes that language 
could not have mediated the cognitive tasks 
involved in the studies. Lest the reader un- 
familiar with the American Sign Language 
assume that this is a very primitive language 
system, it can be stated that the American 
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deaf have a vocabulary of from 3,000 to 
5,000 signs at their disposal and they use 
signs in a syntax peculiar to that language 
(Stokoe, Casterline, & Croneberg, 1965). 
The description of the studies in Furth’s 
article leads us to believe that many cogni- 
tive performances described might well be 
expressed or mediated in the sign language. 


REFERENCES 


Ancorr, W. Н. Scales, norms and equivalent scores. 
In R. L. Thorndike (Ed.), Educational Measure- 
ment. (2nd ed.) Washington, D. C.: American 
Council on Education, 1971. 

Вглхк, M., & Brinces, W. Н. Conceptual cross- 
modal transfer in hearing and deaf children- 
Journal of Child Experimental Psychology, 1969, 
7, 407-408. 

BORNSTEIN, Н, & KANNAPELL, B. New signs 101 
instructional purposes. ERIC, 1969, No. ED-042- 
293. 

FunrH, H. Linguistic deficiency and thinking: m 
search with deaf subjects 1964-69. Psychologica 
Bulletin, 1971, 76, 58-72. " 

GENTILE, A., & Dı Francesca, S. Academic achieve- 
ment test performance of hearing impaired stit- 
dents in United States: Spring 1969. (Series D, 
No. 1) Washington, D. C.: Gallaudet College 
Office of Demographic Studies, 1969. 

STOKOE, W., CASTERLINE, D., & CRONEBERG, C. 
Dictionary American Sign Language. Washington: 
D. C.: Gallaudet College Press, 1965. 


(Received January 14, 1972) 


EE تون‎ ۲ 


| 
| 


Psychological Bulletin 
1973, Vol. 79, No. 3, 215-216 


FURTHER THOUGHTS ON THINKING AND LANGUAGE 


HANS G. FURTH ! 
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Deaf children are characterized as deficient, that is, limited, in their competence 
in handling linguistic rules, not necessarily in knowing simple words or signs. 
It is suggested that Bornstein and Roy's theoretical preconditions for studying 
the language-thinking hypothesis are experimentally untestable. An adequate 
interpretation of their tables on reading and language confirms rather than 
weakens the claim of limited linguistic competence in deaf children. 


Bornstein and Roy (1973) claim that they 
do not necessarily oppose my conclusions 
(Furth, 1971) but they consider my descrip- 
tion of linguistic deficiency inadequate. On a 
terminological level, they would prefer the 
term “limited competence.” I chose the term 
deficiency” in analogy with mental defi- 
ciency which implies some limitation in 
mental competence, not a total lack. 

On the linguistic status of deaf persons 
the authors unfortunately give only scant 
Clarifications so that this important topic 
remains to be studied. In the meantime any- 
One can observe—and this is not disputed by 
Bornstein and Roy—that the great majority 
Of deaf children have a strikingly limited 
knowledge of any linguistic system. More- 
Over, the authors accept (and in part docu- 
Ment) my claim that most deaf children 
learn sign language only after entering 
School. I have explicitly stated that I focus 
Оп the knowing or not knowing of a linguistic 
System and not of single words. I have indeed 
equated linguistic deficiency with a “lim- 
ited competence in handling a grammatical 
Structure,” 

One could hypothesize that for intellectual 
development the first 6 years are crucially 
important and the absence of a conventional 
Symbol system such as English or sign lan- 
Suage during these years could have harmful 
effects on the developing intelligence. It is of 
Scientific interest to demonstrate that the 
absence of a svstematized conventional lan- 
праве during the early formative years of 
Е has few deleterious effects on intellectual 
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development. This is what I have tried to 
show in Furth (1971) and have further 
documented, including social and personality 
aspects, in Furth (1973). 

The main theoretical objection of the au- 
thors to my position is contained in such 
phrases as, "language skills necessary or ap- 
propriate for the cognitive task" or “language 
skills in English or sign language for any 
given cognitive task." I have never claimed 
and do not need to claim in support of my 
thesis that deaf children do not know the 
potentially correct word, gesture, or sign for 
enunciating the relevant principle of some 
tasks (eg. "no," “same,” “blue,” “big,” 
“three,” etc.). The authors feel that “it is 
incumbent upon Furth to demonstrate, for 
each study, that a word, phrase, or series of 
phrases is not sufficient language to perform 
the task required [p. 212]." This statement 
may appear plausible and meaingful to some- 
one who holds the theory that a linguistic 
medium is a prerequisite for intelligent per- 
formance. However, what experimental design 
could conceivably demonstrate that certain 
words or phrases are not sufficient for a suc- 
cessfully performed task? It is known from 
animal experiments that a simple- and double- 
alternation task can be performed without 
language. А 4-year-old child may know the 
words ‘left, left, right, right" and nevertheless 
fail on double alternation. But how could one 
go about showing that for an 8-year-old child 
who succeeds on double alternation, knowing 
the phrase “left, left, right, right" is not 
sufficient? I wonder how the authors would 
interpret the performance I reported in 
Study 20 on probability, or in Studies 21 
and 22 on visual transformation. Apart from 
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the question of sufficiency, what language 
skills could be relevantly investigated? 

As to Table 1 in the Bornstein and Roy 
paper, which is adduced as evidence of fifth- 
grade achievement in reading for deaf young- 
sters, it can be seen readily that these scores 
are primarily a function of the scoring floor 
of the different tests. For the same tests no 
improvement at all takes place over an age 
range of 5 years; the only improvement 
occurs across different tests with scores that 
are rather close to chance levels for the 
respective tests (2.8, 3.6, 4.8). 


Table 2 is based on a highly select sample. 
But it does confirm what I have reported. 
Only a small minority (13%) of the 220 
students learned sign language írom their 
parents. The additional finding that only 
5395 of this select sample is fluent in sign 
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language is particularly illuminating as it 
indicates that even after leaving high school 
a great many deaf youngsters are at home 
neither in English nor in sign language. They 
are linguistically deficient in the precise sense 
of the word, in contrast to hearing persons 
who are all fluent in at least one language 
irom 4 years of age onwards. 
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A PSYCHOLOGICAL PERSPECTIVE ^ 


ROBERT HOGAN ? 


Johns Hopkins University 


1 This paper sugge: 
of moral character can be d 


moral knowledge, socialization, empathy, 
moral judgment. These dimensions arise 


formulating and rule-following animal, 
a result of biological and cultural evolution, which regulates and 
Each concept is given a theoretical and an opera- 


odology, 
moderates social conduct. 


tional definition, its developmental anteced 
vided concerning its usefulness as a means for underst; 


pro 


sts that moral conduct can be explained, and the development 
lescribed, in terms of five concepts or dimensions: 


autonomy, and a dimension of 
from a conception of man as a rule- 
and a view of morality as a meth- 


ents are discussed, and evidence is 
anding moral conduct. 


Finally, the model as a whole is evaluated. 


The subject of moral conduct contains some 
Pi the most intriguing problems and para- 
Oxes in the social sciences. These problems 
(e&., why а man would follow the path of 
Em resistance; why saints seem. obsessed 
thinks sense of sin) have preoccupied social 
D ES since Plato, including, more recently, 
urkheim, Weber, Freud, Piaget, William 
m and George Herbert Mead. In the 
B stages of American psychology there 
E great interest in the "psychology of 
E conduct”: however, the Character Edu- 
"s ion Inquiry (Hartshorne & May, 1930) 
€med to demonstrate that most such be- 
ои is situation specific, and the subject fell 
@ Subsequent neglect. In a series of papers 
da A in 1958, Kohlberg revived interest 
Ex psychology, and his research on 
—_ Judgment exemplified the relevance, 
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complexity, and psychological richness tradi- 
tionally associated with the topic. This paper 
describes a new psychological perspective on 
moral conduct, and presents some empirical 
findings which follow from this perspective. 
The paper begins by outlining the assump- 
tions and definitions required for the subse- 
quent discussion. The burden of the paper, 
however, is to argue that development of 
moral character can be described, and moral 
conduct can be explained in terms of five con- 
cepts or dimensions: moral knowledge, so- 
cialization, empathy, autonomy, and a dimen- 
sion of moral judgment. Each dimension is 
defined both conceptually and operationally, 
its developmental antecedents are described, 
and evidence is presented concerning its em- 
pirical utility for understanding moral con- 
duct and character development. Finally, the 
model taken as a whole is discussed. 


ASSUMPTIONS 


An earlier paper (Hogan & Henley, 1970) 
suggested that as an alternative to explaining 
social behavior in terms of needs, drives, 
traits, or mechanisms of coping and defense, 
it might be useful to consider man as a rule- 
formfülating and rule-following animal. The 
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argument derives from a more comprehensive 
theory of social behavior based on psychologi- 
cal (as opposed to sociological) role theory, 
and an image of man as a game-playing ani- 
mal (cf. Goffman, 1959; Huizinga, 1955; 
Peters, 1958; Wright, 1971). While the de- 
tails of the argument are not important here, 
three of its assumptions are (the first two as- 
sumptions are substantive, the third follows 
eir methodological consequence). Tt is 
лей first that all purposive social behav- 

ccurs within matrices of overlapping hu- 
man rule systems (e.g., grammars, legal codes, 
rules of courtesy) which have varying degrees 
of specificity. (Tt follows, somewhat tauto- 
logically, that behavior not guided by rules is 
nonpurposive or “random.”) Second, associ- 
ated with every human rule system is an ethic, 
variously called sportsmanship, fairness, jus- 
tice, equity, or the spirit of the game. 
Huizinga (1950), for example, remarks that, 
“In every... community that is healthy . . . 
there will blossom an ideal of chivalry and 
chivalrous conduct, whether it be in Greece, 
Arabia, Japan, or medieval Christendom [p. 
64].” This ethic, usually known and acknowl- 
edged by seasoned participants of a given 
rule system, is a spontaneous natural emer- 
gent, a product of role-taking experience and 
cognitive development, Tts function is to pro- 
vide a perspective from which the equity of 


conventional rules may be judged, Third. the 
broad outlines and gen 


behavior can be expl 
Properties of various 
Individual differences 


evolutionary pressures, 


Standing of moral behavi 
knowledge of man’s bi i 
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four are particularly relevant to the model 
presented below. First, as Waddington (1967) 
argues, the dynamics of human culture neces- 
sarily presuppose “the role of хонани 
ceptor.” Consequently, a disposition to compi 
with authority is considered to be part 0 
man’s innate constitution, and this disposition 
is seen most vividly in children. Second, реи 
win (1872) suggested in The Expression E 
the Emotions in Man and Animals that Dd 
is naturally disposed toward altruistic ee 
of group loyalty. The biological пе 
altruistic tendencies has been further nd 
cussed more recently by Campbell (1965) helt 
Hamilton (1971); the implication of те 
arguments is that man is naturally sens! a 
to the expectations and demands of his рее! 


lation: 
;be 
em- 


Serve as innate mechanisms for ig t 
The four biological concepts just - 
have obvious relevance to mpi d 
pathy, autonomy, and internal stand: gycho 
the evaluation of behavior, the зор теп 
logical processes underlying the deve 
of moral character and conduct. з consider 
Many contemporary psychologists t from 
moral action to be in some way i i E 
ordinary social conduct. Thus Reuse. feel 
marks that in the moral domain, perso! them- 
that rules are constraining in and e 
selves, whereas in the social (nonmora sentio” 
persons recognize the content of CON serv? 
as artificial. Wright (1971) similarly 0 nich 
"Both the conceptual framework me whi 
they are formulated and the experienc! 


) area: 


ya 
ана А ССГ” of йб м 
3L. Kohlberg. The development of pap ори? 
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goes with them mark out moral rules as dis- 
Unctively different from, for example, the 
rules of tennis, grammar, or etiquette |р. 14].” 
In Contrast, the present paper makes no sharp 
distinction between social and moral rules, 
Rather, it assumes that an evaluative tendency 
1s an integral part of social conduct, and that 
morality has a social job to do; that is, the 
function of morality is to regulate and mod- 
erate human affairs. The following quotations 
express the flavor of this assumption: 


Moral conduct is essentially social conduct, and there 
Could be no serious objection to the use of the two 
Expressions as synonymous; but it is more in con- 
formity with common usage to restrict the term 
moral” to the higher forms of social conduct of 


oo man alone is capable [McDougall, 1908, p. 
о! . 


Manners in their best sense . . . cannot be distin- 
linguished from morals, and are nothing but the 
xpression of the courtesy of an individual toward 
People about him [Mead, 1934, p. 263]. 


These remarks serve as a background for 
the definitions of morality, moral behavior, 
and moral character which follow. 


DEFINITIONS 
Morality 


According to Hart (1962), the basis of any 
Morality is a system of rules of conduct which 
eines a network of reciprocal rights and ob- 
‘gations and which prohibits at least gross 
Acts of malevolence. Moralities, then, are sys- 
‘Ms of rules that are external to people, de- 
"IBhed to guide social or interpersonal be- 
шк, and which may to some degree be 
Odified and spelled out. This definition does 
t require a morality to be entirely com- 
а que With or to further unequivocally the 
; are of individuals subject to it, nor does 
Tequire that a particular morality be com- 
*tely equitable when reviewed from a per- 
е Outside the system in which it ap- 
Пау In fact, the demands of justice or fair 
tun in a given situation may occasionally 
Contrary to the prevailing moral code. 


Moral Behavior 


5] 


a behavior consists of actions carried 
Biven regard to the rules that apply in à 
Social context (see also, Wright, 1971, 
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p. 15). Obeying, disobeying, enforcing, sub- 
verting, learning, and using rules are all types 
of moral conduct or misconduct requiring ex- 
planation; misconduct is thus considered a 
subclass of moral behavior. For pragmatic and 
statistical reasons, one form of moral behavior 
(obedience or conformance) is usually salient. 
The most impressive single feature of every- 
day life is the degree to which people obey 
the norms of their culture. Even such self- 
consciously debauched men as Aliester Crow- 
ley, the Marquis de Sade, and Ceseare Borgia 
committed only a statistically insignificant 
number of deviant acts each day, and these 
were of a highly idiosyncratic and specialized 
nature, confined to limited subsystems of be- 
havior. In the final analysis, moral behavior 
typically comes down to either following or 
disregarding a social rule of some sort; con- 
sequently a major problem for the psycholog, 
of moral conduct is to account for social com- 
pliance or noncompliance. 


Moral Character 


Freud pointed out that neurotic symptoms 
could be understood only by considering them 
within the context of the character structure 
of the person involved. Thus cigarette smoking 
in an anxiety neurotic might represent a means 
for gaining social approval, whereas it would 
be self-destructive behavior in a depressive. 
For Freud, then, neurotic character rather 


than neurotic symptoms became the primary 
subject matter. Similarly, within the context 
of moral psychology, delinquency, civil dis- 
obedience, and altruistic actions can be under- 
stood only by taking into account the char- 
acter structure of the person involved, The 
concept of character closely resembles the 
notion of personality. Character, however, is 
usually defined in terms of those dispositions 
and traits that are subject to moral evalua- 
tion within a society. That is, character is 
defined not by what a person does, but by 
his reasons for doing it, by the recurring mo- 
tives and dispositions that give stability and 
coherence to his social conduct. A second 
major problem for moral psychology, there- 
fore, is to describe the dimensions along which 
character structure seems to vary. 
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THE MODEL 


There are five recurring themes in philos- 
ophy and psychology which, when taken to- 
getner, explain a considerable range of moral 
behavior and define certain important param- 
eters of character development. These ele- 
ments are at least conceptually independent 
and seem to characterize, in a formal way, 
how people differ in their use of rules. That 
is, these concepts are abstract dimensions of 
individual differences in nomotic (rule-gov- 
erned) behavior, and as such, they should 
help to explain moral conduct in any socio- 
cultural context. The first dimension is sim- 
ply moral knowledge. The second corresponds 
to socialization in the Freudian sense of super- 
ego development, and is denoted by having an 
internal as opposed to an external viewpoint 
concerning the rules of one’s society. The 
third dimension involves role taking and the 
recognition of the rights of others; it is the 
disposition to adopt the “moral point of 
view.” The fourth dimension concerns the de- 
gree to which a person considers rules to be 
useful for regulating human affairs. The final 
dimension is defined by the degree of au- 
tonomy a person displays in his moral con- 
duct, 

As an aid to understanding the interrela- 
tionships among these five concepts, consider 
the following example as a paradigm case. A 
college student of modest talent, while taking 
a difficult examination whose outcome he be- 
lieves will affect his future career, observes 
the answer sheet of an unusually gifted stu- 
dent lying in plain view in front of him. How- 
ever, he deliberately ignores the answer sheet, 
thereby losing an opportunity to improve his 
score. How can we explain his action? 


MORAL KNOWLEDGE 
Explanatory Role of Moral Knowledge 


_ The first answer to our paradigm question 
is that the student knows a rule that pro- 
hibits copying answers from another person’s 
test. This is important because people can 
follow only the rules that they know. Moral 
knowledge means knowledge of moral rules 
and a proper test of moral knowledge assesses 
the number and kinds of rules a person can 
State, or the variety of rules that he can cor- 
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rectly use, It does not determine how he feels 
about these rules, how he reasons with them, 
or how he evaluates them. 

Because moral knowledge implies that 
moral learning has occurred, the next ques- 
tion concerns the kinds of things that are 
learned during the moral education of the 
child. Children seem to learn three distinct 
types of rules: (a) specific, primarily nega- 
tive injunctions related to particular acts, 
such as “Don’t get dirty,” "don't hurt the 
cat;” (b) moral principles—general rules к 
norms oí conduct, corresponding to Piaget 
notion of a schema, that permit flexibility e 
social action and enable the child to Mes 
between conflicting injunctions (8.8: the 90); 
cept of reciprocity—cf. Gouldner, 196 (s. 
(c) comparison rules—cognitive strate o E 
which the person is able to compare what pA 
doing with an ideal norm of behavior. ЫЙ 
first two rules are substantive while the © 
is procedural; that is, comparison le nile 
operate primarily on injunctions ern 
moral principles (b) mediate between t ise Í 

Moral knowledge is important becaus" ү 
provides the potential for self-control. 41029 
is, by learning a set of specific injunc one 
moral principles, and comparisons тезу d 
acquires the cognitive foundation wed (but 
criticism, which in turn is a necessary’ jop- 
not sufficient) condition for the later de mpl 
ment of self-control, Assume, for CX? and, 
that a tennis player is losing а oan the 
after mentally comparing his oen е} 
various rules of tennis form, he deci 3 or no! 
not volleying deeply enough. Whethe in 26 
he controls his subsequent strokes бп dg 
cordance with this self-critical perceP rus, 
pends largely on his interest in ign the 
knowledge of the appropriate rules а" шр 
capacity for self-criticism afford the trol. 
bility but not the assurance of selí-co? 


Measurement of Moral Knowledge ent 


Although considerable effort has BE or 
studying the behavioral implications 66): 
knowledge (cf. Pittel & Mendelsob ne^ 
this research has produced essent! M an 
gible results. For example, the Harts qur 
May (1930) studies included several ^ ей! 
of moral knowledge, none of which Patioa 
resistance to temptation in check * 
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Nor was there a significant relationship be- 
tween a child's evaluation of the wrongness 
of cheating during interviews and his be- 
havior in experimental situations. Similarly, 
Gordon, Short, Cartwright & Strodtbeck, 
(1963) found no differences in the degree to 
Which six groups of teenage boys (black and 
white; delinquent and nondelinquent; lower 
class, middle class) understood the applicabil- 
ity of middle-class norms. Moreover, as Maller 
пш) Observed, tests of moral knowledge 
Orrelate about as highly with IQ tests as 
is. ken correlate with themselves. Moral 
ieee seems to be primarily related to 
3 Igence, background, and desire to make 
Es impression (cf. Kohlberg, 1964). Be- 
а we are normally justified in assuming 
m people know the rules, it seems naive to 
» те a strong connection between knowl- 
Len of the rules and willingness to abide by 

€m. As Malinowski (1932) pointed out in 
" "e and Custom in Savage Society, there 
E €viants in every society; however, no one 
ч doubts what is held to be right and 
Wi 8. The problem lies in learning to live 
ith the “right” of one's society. 


SOCIALIZATION 
The Ex. "S 
*e Explanatory Role of Socialization 


ere second answer to the paradigm ques- 
Бо that our student may consider the rule 
t test cheating to be personally binding; 
tae 1s, he has “internalized” the rules of test 
Hagen The term socialization has few dis- 
Bare conceptual properties remaining after 
м Eos uncritical use (cf. Clausen, 1968). 
Derso пе purposes of this paper, however, a 
egree may be considered socialized to the 
oan he regards the rules, values, and 
ator tions of his society as personally man- 
eels Conversely, to the extent that one 
is б E from the rules and norms of 
аго ial group, he tends to Бе unsocialized. 
ang а (1962) distinction between an internal 
refle n external attitude toward social rules 
a cts the phenomenology of socialization: 
һе 


rules of conduct, 
for many closely 
for it is 
her merely 


ы Social group has certain 
relat affords an opportunity 
Possit Vet different kinds of assertions; 
= © to be concerned with the rules, eit 
Observer who does not himself accept them. 


or as a member of the group which accepts and 
uses them as guides to conduct. We may call these, 
respectively, the “external” and the "internal" points 
of view [p. 86]. 


Hence a society with law contains those who look 
upon its rulings from the internal point of view as 
accepted standards of behavior, and not merely as 
reliable predictions of what will befall them, at the 
hands of officials, if they disobey. But it also com- 
prises those upon whom, either because they are 
malefactors or mere helpless victims of the system, 
these legal standards have to be imposed by force 
or threat of force; they are concerned with the rules 
merely as a source of possible punishment [p. 197]. 


Developmental Antecedents oj Socialization 


Most social psychologists assume some sort 
of natural antagonism between the individual 
and society. Freud, for example, believed that 
a child becomes able to live in society only 
after his natural but implacably anarchic tend- 
encies are permanently repressed. One might 
equally hypothesize, however, that children 
are not anarchical but social by nature, that 
they enter the world programmed to be obedi- 
ent, and that warmth and nurturance are es- 
sential in eliciting these tendencies. For ex- 
ample, in a sample of 11-month-old infants, 
Stayton, Hogan, and Ainsworth (1971) found 
that, of four key parent-child variables, only 
maternal warmth predicted obedience to com- 
mands (7 = .74). Thus, rather than ask what 
must be done to the child to fit him into so- 
ciety, it may be more important to ask what 
must be done in order to drive him out. 

Considerable research has been directed 
toward discovering the developmental ante- 
cedents of socialization, and the resulting lit- 
erature is too extensive to review here. The 
available evidence, however, strongly suggests 
that while warmth is important, “love is not 
enough,” that as children become more mo- 
bile, control factors play a correspondingly 
larger role in socialization. Thus warm, nur- 
turant, amd consistently restrictive or “au- 
thoritative" parents produce the most so- 
cialized children (cf. Bandura & Walters, 
1959; Baumrind, 1971; Becker, 1964; Bron- 
fenbrenner, 1970). The socialization process is 
largely completed by the time a child enters 
school, and results in what Piaget (1964) 
called “moral realism,” that is, an authori- 


tarian ethic of conformity. 
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Measurement of Socialization 


It is with regard to the socialization dimen- 
sion that a major measurement breakthrough 
in the study of moral conduct first occurred. 
The Socialization scale of the California Psy- 
chological Inventory (Gough, 1969; Gough 
& Peterson, 1952), an empirically keyed mea- 
sure developed by comparing the responses of 
a large number of delinquents and nondelin- 
quents, was specifically designed to assess the 
degree to which a person has internalized 

“th ‘rules, values, and conventions of his so- 
FAN As an example of its remarkable prop- 
erties, the measure was given in eight different 
languages in 10 countries to totals of 21,772 
nondelinquents and 5,052 delinquents. In 
every comparison the test differentiated sig- 
nificantly between delinquents and nonde- 
linquents; furthermore, in no instance was the 
mean of a delinquent sample in one country 
greater than the mean of a nondelinquent 
sample in any other country (cf. Gough, 
1965). 'The scale seems to work equally well 
at various levels of socialization. At the upper 
level, Holland (1959) found, for example, 
that Socialization scores significantly pre- 
dicted academic achievement among National 
Merit Scholars. At the lower end, Vincent 
(1961) found that the scale differentiated be- 
tween female welfare recipients with one il- 
legitimate child and those with two or more, 
Thus, Gough’s Socialization scale seems to be 
a highly valid empirical index of the degree to 
which one has internalized culturally defined 
rules. 

Willingness to follow rules is an important 
determinant of moral behavior and a major 
component of character development. None- 
theless, two considerations Suggest that fur- 
ther information is necessary in order to as- 
Sess adequately a person's character structure. 


First, other things being equal, well-socialized 
Persons are also lik, 


t ely to be Stuffy, rule- 
bound, pedantic Prigs (e.g, “moral realists”). 
Second, many people who consistently comply 


with social norms did not enjoy the sort of 
childhood experiences that Produce a high 
degree of socialization. As Megargee, Parker 


and Levine (1971) demonstrate, within any 
Population of nondelinquents, itis always pos. 
Sible to identify a sizable 


subgroup charac. 
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terized by low Socialization scores. Phrasing 
the question in terms of our original paradigm 
case, how can we explain the student's re- 
fusal to cheat if we discover that he is in fact 
poorly socialized? 


EMPATHY 
Explanatory Role of Empathy 
, in the 

For at least 300 years philosophers 1 f 
Anglo-American (or utilitarian) tradition M 
ethics have assumed that man has an pec 
social sensitivity which plays an паро ү 
role in moral development. John Stuart 2 ly 
suggested, for example, that man is пареа 
sympathetic and that а sense of duty hc 
on the development of these social fee id 
Kurt Baier, a modern utilitarian philosoP are 
has proposed that certain social ped of 
justified when seen from the “moral йде r 
view,” a perspective which tends to atur 
the common good. This viewpoint is à dren? 
attitude which may be taught to e 0 
"When we teach children the moral pet 
view, we try to explain it to them by person" 
them to put themselves in another I є done 
place: How would you like to have tha when 
to you [Baier, 1965, p. 107]!" Thusi 7 fy 
a person acts from the moral | ac- 
tries to consider the implications of 0510100 
tions for the welfare of others. The per 16 
to take the moral point of view is C108% ° 
lated to empathy or role taking. ding J- М. 

Several social psychologists, inclu "Mead; 
Baldwin, W. McDougall, and б. ^i. 
have also assumed that people hee Y 
nate capacity for empathic sensitiv! nce. 
is normally elicited by social er d? a p 
these role theorists, group partcps, 0 
an increased sensitivity to the de? hich, i 
others results in self-consciousness v pules 9 
turn, causes the child to observe kii this | 
his society. The implications of а ei tha 
the explanation of social complian¢ ig И 
the test-taking student in our pepe js se? 
may refrain from cheating because rning iea 
tive to his peers’ expectations conce е see” d 
behavior; cheating might equally stude” 
taking unfair advantage of the other ° 


Measurement of Empathy item, a 
Hogan (1969) developed a ee 1 
Dirically keyed empathy scale БУ 
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the responses of 211 subjects, divided into 
high- and low-subgroups on the basis of em- 
iO иш, across 957 items taken from 
E Alora Psychological Inventory, the 
MMPI, and a special pool of additional items. 
The best item on the scale in the original item 
analysis was: “As a rule I have little difficulty 
in ‘putting myself into other people's shoes.’ ” 
The scale correlated .00 with Socialization, in- 
dicating that the two dimensions are indeed 
independent. It also correlated .58 with rated 
social acuity, .53 with overall skill at playing 
charades, and .43 with rated ethical sensi- 
tivity. 

If socialization and empathy are indepen- 
dent aspects of moral development, then in- 
dividual differences along these two dimen- 
should be associated with differences in 

aracter structure and moral conduct (see 
Table 1), Specifically, other things being 
RUM, persons scoring low on both scales tend 
i be delinquent. Those receiving low scores 
or empathy but high scores for socialization 
E. to be rigid rule followers—Piaget’s moral 
tior m Persons with low scores for socializa- 
BÉ E high scores for empathy tend to be 
ET about the conventional rules of 50- 
ра they are “emancipated,” mildly socio- 
sis E of normal society, that is, per- 
ей ie double-park in parking lots, do not 
m orrowed books, and smoke marijuana 
im. 5 refers to them as “chic types.” Per- 
ES with high scores for socialization and 
P» Pathy tend to be morally mature; their 

"ibliance with social rules is effortless but 
p. attitudes are tempered by a sympathy 

the moral frailities of others. 
тч empirically as well as conceptually use- 
аа assess social conduct in terms of both 
Pepsi and empathy. For example, the 
a ш for socialization and empathy for 
re bir e of 100 inmates of a New York State 
values y were 26.2 and 29.14 Comparable 
Were 36 ог a group of 100 Air Force officers 
Scoring 4 and 37.7. By considering all persons 
SH өң; below 36.5 on socialization and 39.0 

™pathy as delinquent, 95% of the de- 
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TABLE 1 
SUGGESTED CHARACTEROLOGICAL IMPLICATIONS OF THE 
INTERACTION BETWEEN SOCIALIZATION 
AND EMPATHY 


Socialization 


Empathy 
Low | High 


High “Le chic type" Morally mature 
Low | Delinquent “Moral realist” 


linquents were accurately classified, while 
27% of the nondelinquents were misclassified. 
Ina sample of 594 male undergraduates, Kur- 
tines and Hogan (1972) identified 130 stu- 
dents whose socialization scores were less than 
or equal to the mean socialization score of 
119 incarcerated delinquents (X = 26.88). As 
expected, the unsocialized students were rela- 
tively empathic compared to the delinquents 
(rus = 44). In a study of undergraduate 
marijuana use (Hogan, Mankin, Conway, & 
Fox, 1970), students were placed in three 
groups: users, nonusers, and principled non- 
users( e.g. students who said that they had 
not smoked marijuana and never would). Rel- 
ative to men in general, users received low 
scores for socialization and high scores for 
empathy, while the pattern was precisely re- 
versed for the principled nonusers. Thus there 
is some evidence for the characterological im- 
plications suggested by Table 1. 


Developmental Antecedents of Empathy 


In spite of the importance of empathy or 
role-taking ability as an explanation for moral 
conduct, little is known about the antecedents 
of this disposition. However, it seems rea- 
sonable to assume that the following four 
factors are related to the development of 
empathy. First, Mead (1934) thought role- 
taking ability was the “g” factor in intelli- 
gence. Role taking requires that a person 
adopt alternative perspectives with regard to 
various issues; the process of shifting perspec- 
tives seems similar to spatial reasoning (an 
important component of intelligence) ; and the 
empathy scale correlates between .30 and .50 
with several measures of intellectual per- 


formance. Thus there seems to be some as- 
sociation between intelligence and empathy. 
Second, being required to adopt alternative 
perspectives vis-d-vis one's parents should 
facilitate the development of role-taking skills. 
Consequently, parents who either overindulge 
or consistently reject their children probably 
fail to stimulate their children’s natural em- 
athic tendencies. Third, intelligence and 
E at role taking are necessary but in- 
‘sufficient conditions to produce empathic be- 
ha the child must also be willing to act 
hohe а perceptions. 

Such a willingness probably depends on 
two kinds of experience: (а) receiving em- 
pathic treatment at some time in one's life 
and (5) enduring sufficient injustice, ridicule, 
betrayal, or persecution that the experience 
makes a lasting impression. The fourth factor 
that seems to contribute to an empathic dis- 
position is a relative absence of repression or 
denial—an openness to inner experience, a 
willingness to attend to intuitive promptings 
and nonverbal cues, The development of em- 
pathy, like cognitive development generally, 
is probably finished by late adolescence and 
results in what may be called an “ethic of 
reciprocity,” characterized by cooperation and 
peer group conformity. 

In spite of the obvious importance of 
empathic disposition 
moral character, it is n 
most important factor 


an 
in the formation of 
either the only nor the 
in the process, There 


ty high scores on the 
ggest that they often suffer 
f role taking: They are too 
the expectations of others, 


, 
S re- 
he is 
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ErHics or CONSCIENCE AND THE ETHICS 
OF RESPONSIBILITY 


Explanatory Role oj the Ethics of Conscience 
and the Ethics of Responsibility 


There is a branch of moral philosophy ку 
analyzes and evaluates arguments used ans 
justify social and legal institutions. 1 pos 
siders questions of the following type: ly 
what kinds of considerations is appeal propery 
made when the authority of the political ee 
legal order and our obligations to comply a 
its dictates are called into question jer nt 
1961]?" In the history of modern lega it 
losophy two forms of justification cont a 
recur. Advocates of the first point lated to 
argue that there are higher laws, unrela vere 
human legislation, which may be шо just 
by intuition and reason; a human law m de 
if, and only if, it corresponds to or € o take 
rived from the higher laws. Those кг 
the second viewpoint deny the Le 
“higher” laws. Instead, they дап value 
arguments in terms of the instrument? — 
of the manifest law as a means for Pi 
the general welfare of society. Laws а! for 
cal institutions are merely instruments { Јам 
realization of the common good; JUS 
are those which tend on the whole (0 
mize happiness. 

With certain exceptions, the gu x 
has traditionally had the greater in^ 


There can be little doubt that id 
justification which has exerted the wides 
influence and most decisively EON 
development of social philosophy iS pe said of 
natural [ie., higher] law. It may eve? enterP us 
with the idea of natural law the whole ountin# E 
calling existing institutions to moral acc 

underway [Olafson, 1961, p. 81. 


theory 


з neret 

Utilitarianism and legal positivism 9^. inst 
in the eighteenth century as à FE E ад P 
the theory of higher law. The u 00 р 
losophers, primary architects of had y 
reform in the nineteenth century, t 4 ar i 
the concept of higher law was эн an К" 
standard for appraising institute jg jed 
it encouraged an anarchical indiv! " set! 
consistent with the rule of law ? 
Society. 

The intellectual and psychol 3 
underlying the higher law and P?" 


tue 


ү 
А 1 p? aw 
ogica itive 1 
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traditions in moral philosophy have numerous 
counterparts in everyday life and represent a 
relatively unexplored dimension of character 
development. The everyday equivalent of 
higher law morality can be called the ethics of 
personal conscience, while the ethics of social 
responsibility reflects the positive law tradi- 
tion. The ethics of conscience is the dominant 
Viewpoint of the contemporary American in- 
tellectual establishment; it is represented, for 
example, by many contributors to the New 
York Review of Books. On the other hand, 
Supreme Court Justice Frankfurter, Erwin 
Griswold, former dean of Harvard Law School 
and Solicitor General of the United States, 
and Н. L. A, Hart, the distinguished English 
legal and moral philosopher represent promi- 
nent recent spokesmen for the ethics of re- 
Sponsibility, 

Consider once again our hypothetical, 
DOncheating student whom we have discov- 
fred is neither well-socialized nor empathic. 

he advocates the ethics of responsibility, 
» тау then comply with the rules because 

léy make his world more orderly and pre- 
ооа That is, а belief in the instrumental 
blian, of rules should produce social com- 
Ru Despite deficient socialization and in- 
inc d to the expectations of others. Con- 
Social? persons who are nonempathic, un- 
will ized, and endorse the ethics of conscience 

Very likely be delinquent. 


Measurement of the Ethics of Conscience and 
€ Ethics of Responsibility 
Hogan (1970) developed a measure of the 
vi tion to adopt one ог the other of these 
pun nts. There are two forms of the test, 
with. the Survey of Ethical Attitudes, each 
Tepr, 35 items, The following statements are 
jug d E (а) All civil laws should be 
and w against a higher moral law; (0) right 
tong can be meaningfully defined only 
obeyed law; (c) an unjust law should not be 
Would 5 (d) without law the life of man 
€ nasty, brutish, and short. 
of S items are answered by checking one 
Strong] Tesponse options ranging from agree 
n _to disagree strongly. The parallel 
Colle reliability of the test in a sample of 98 
men was 88. The measure is uncor- 


relat 
*d with intelligence, and in two separate 
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samples it discriminated very strongly be- 
tween persons whose vocational choice re- 
flected a belief in law and established pro- 
cedures (i.e., policemen, ROTC seniors) and 
persons who believed in civil disobedience as 
a means for promoting social change (cf. 
Hogan, 1970). 

In contrast with Kohlberg (1963) and 
Piaget (1964), the author assumes that moral 
conduct is fundamentally “irrational,” that 
differences in even such obviously cognitive 
phenomena as moral judgments derive from 
more basic personological structures. If so, then 
there should be clear-cut personality correlates 
of the disposition to adopt the ethics of con- 
science or responsibility. Correlations with 
the California Psychological Inventory, the 
Myers-Briggs Type Indicator (Myers, 1962), 
and several other personality measures sug- 
gest that persons who adopt the ethics of 
personal conscience possess an interesting set 
of positive and negative characteristics. They 
are independent, innovative, and form creat- 
ing; however, they also tend to be impulsive, 
opportunistic, and irresponsible. From peer 
descriptions the adjective with the highest 
correlation (.49) with the ethics of conscience 
was “rebellious.” Persons whose moral judg- 
ments reflect the ethics of social responsibility 
also show a blend of attractive and disagree- 
able features. On the one hand, they are rea- 
sonable, helpful, and dependable; on the other 
hand, they often seem conventional and re- 
sistant to change. The single adjective from 
peer descriptions most characteristic (r= 
47) of the ethics of responsibility was 
“thoughtful.” 


Developmental Antecedents of the Ethics of 
Conscience and the Ethics of Responsibility 

The next question concerns the develop- 
mental antecedents of the ethics of conscience 
and responsibility. A study by Hogan and 
Dickstein (1972b) provides some insight into 
this problem. Interviews with high and low 
scorers on the Survey of Ethical Attitudes 
suggested that both groups were highly sensi- 
tive to injustice, but differed markedly with 
regard to their perceptions of the sources of 
injustice in society. The ethics of conscience 
seemed related to the belief that people are 
naturally benevolent and that social injustice 
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is produced by dehumanizing and oppressive 
institutions; the ethics of responsibility 
seemed to rest on the notion that people are 
naturally malevolent and that institutions re- 
strain the implacably antisocial impulses of 
men. Scores for the tendency to blame people 
or institutions were assigned on the basis of 
a projective test (Hogan & Dickstein, 1972a). 
The average correlation between these scores 
and the Survey of Ethical Attitudes across 
three groups (total N — 92) was .58 (.77 
when corrected for attenuation), suggesting 
that the ethics of conscience and responsibil- 
ity are indeed related to a person's perceived 
locus of injustice. Thus home environments 
which encourage a belief in natural human 
goodness and a suspicion of institutions tend 
to promote the ethics of conscience. Con- 
versely, parents who foster skepticism concern- 
ing the motives of others encourage a belief 
in the instrumental value of law and, conse- 
quently the ethics of responsibility (see also 
MacDonald, 1971). Finally, while the moral 
and developmental implications of the ethics 
of conscience and responsibility can be prop- 
erly understood only within the context of a 
person's total character structure, the most 


mature people should tend to cluster in the 
center of this dimension. 


AUTONOMY 
Explanatory Role oj Autonomy 


Two considerations suggest that the dimen- 
sions of moral development discussed this far 
(i.e., moral knowledge, socialization, empathy, 
and the  ethics-of-conscience—ethics-of-re- 
sponsibility continuum) are, by themselves, 
incomplete either as explanations of moral 
conduct or descriptions of character structure. 
First, from a philosophical perspective, these 
dimensions reflect a distinct utilitarian bias. 
However, Kant argued that compliance with 
social norms based on self-interest (or the 
larger welfare of society) is in no sense moral. 
Rather, the truly moral man has an au- 
tonomous will and governs his actions by a 
personal sense of dut 


s у. Reflecting the Kantian 
ethic, Nietzsche described utilitarianism as а 


morality fit only for English shopkeepers. 
us, from the viewpoint of traditional ethics 
a complete description of moral development 
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must include the factor of autonomy. The dis- 
cussion of character development presented 
thus far also appears inadequate when i 
from the perspective of common sense. TM 
is, а well-socialized and empathic adherent E 
either the ethics of conscience or resp 
bility is socially conforming, and is a mo ds 
citizen when judged in terms of the standare 
of his sociocultural milieu. However, v 
and legal institutions are never totally Mi 
and political leadership is often misguided née 
not actually corrupt). Consequently, it Ке 
times happens that а model citizen тау cal 
immorally as а result of complying with аш 
lective norms, The development of a? im 
tonomous set of moral standards serves bi of 
sulate one from the potential immorality 
the community. non- 
Returning for the final time to OUT 40 
cheating student, he may refuse to copy я В 
because he is socialized, not because he E n 
cerned about the evaluations of his amo 
because he feels rules are useful, but ү. dig 
he considers cheating to be beneath Ms 
nity as a person. t А? 
Autonomy is a troublesome conch ized 
autonomous person who is also uns0c „ma 
nonempathic, and disregards the pre crate 
value of rules as well, is likely to be a sc 
and anticonforming, a great rogue Е 
drel. Obviously this dimension, lik 


e 
d 

а 2 ч olatio 

others, can never be considered in !5 


Measurement of Autonomy 


Аз a quantitative index en 
author uses a measure of indepe" 
judgment developed by Barron (1 att 
22-item scale was constructed by viel 
the responses of yielders and попу. 
Asch’s original (1956) conformity -y 
The measure has provided some D^ ci 
sults; however, the items are quie mare y 
and the resulting scores have only of auto". 
reliability. An alternative measure rove E 
omy with simpler items and p anm Me 
liability would be a useful measure roce 
tribution, and work is currently P 
on such a measure. i 

Although social psychologists 
a great deal of time studying SUS", 
and conformity, not much is m Я 
autonomy. In an unpublished St 


ibil it 
abo" 


К" 


have Fiy 
g est 


©“ ey. 
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tines? gave the California Psychological In- 
ventory to 30 fraternity members who were 
undergraduates at Johns Hopkins University. 
Eleven of these persons rated all the others 
for autonomy using a 7-point scale. Each per- 
son was assigned an autonomy score based 
on the average of the 11 ratings he received; 
the average interrater correlation was 41, 
апа the reliability of the composite autonomy 
ratings was .89, Eight of the 18 California 
Psychological Inventory scales correlated 
above .39 (p< .05) with these ratings; they 
Were: Dominance (.56); Capacity for Status 
(.60); Social Presence (.42); Self-Acceptance 
(.52); Good Impression (.47); Communality 
(.39); Achievement via Conformance (.40); 
and Intellectual Efficiency (.57). To estimate 
ве pattern of personological variables most 
2 ated to autonomy, a stepwise regression 
Dalysis was conducted, selecting five vari- 
ables, The resulting equation, which had a 
multiple ғ of .76, included Dominance, Ca- 
cones for Status, and Self-Control with posi- 
ма weights, Sociability and Femininity with 
: ative weights. Although the size of this 
‘ample permits only tentative conclusions, the 
ne of zero-order correlations with the 
HE e Psychological Inventory and the 
СВ, equation suggests that the auton- 
cend S person is strong, forceful, and self-as- 
and ant; he manages his affairs very carefully, 
Ht Is little affected by others in choosing or 
ER eving his goals. Such a person resembles 
C Old Testament patriarch ог Melville’s 
aptain Ahab. 


D 
evelopmental Antecedents of Autonomy 


n The Moral Judgment oj the Child, 
ity Set (1964) suggested that children’s moral- 
15 initially “heteronomous,” authoritarian, 
adult onditioned upon a unilateral respect for 
child authority, If left alone by adults, the 
of жыз ышы through the natural processes 
Snitive development and peer interaction 
Бад ai and democratic morality 
Point os mutual respect among peers. On this 
om laget seems surely to have been wrong. 
Daring child-rearing practices 1P the 


partment 


EW, 
of p` Kurtines, A study of autonomy. De te 
› 


Ps d 
197; Chology, Johns Hopkins University, 
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Soviet Union and the United States, Brofen- 
brenner (1970) observes, for example, that 
exclusive peer interaction in American chil- 
dren has produced effects precisely the op- 
posite of those Piaget predicted; while Ameri- 
can children, relative to Russian children, be- 
come emancipated from adult authority 
earlier, they remain thereafter excessively de- 
pendent on peer approval (see also Boehm, 
1957). 

In a carefully documented monograph, 
Baumrind (1971) suggests some important de- 
velopmental precursors of autonomy. First, 
parents who are permissive, undemanding, 
passively accepting, or overprotective inhibit 
the development of autonomy in their chil- 
dren. Second, early development of cognitive 
and linguistic skills and stimulation of in- 
terest in school achievement seem to enhance 
a child's competence, and therefore, the self- 
esteem necessary for autonomous behavior. 
Third, parents who are themselves individual- 
istic, independent, and demanding provide 
their children with the strongest models for 
autonomous conduct. Finally, parents who 
clearly label certain actions as praiseworthy 
or blameworthy, explain their rules and en- 
courage verbal give and take, also promote 
the child's ability to “order” and control his 
own behavior. Although Baumrind has clari- 
fied the developmental antecedents of auton- 
omy, this disposition probably does not fully 
emerge until а person leaves his peer group 
and begins his life (defined in terms of mar- 
riage and career). Then, in combination with 
socialization and empathy, autonomy may 
produce an “ethics of equity "—in psychologi- 
cal terms, moral maturity. 


COMPLETE MODEL 


Five important aspects of character de- 
velopment have been described which, it has 
been suggested, are related to five considera- 
tions required for the explanation of moral 
behavior. Each dimension is defined in terms 
of a specific assessment device. While the 
dimensions are conceptually independent, the 
relationships among the scales used to define 
them can be determined empirically. Table 2 
is based on data taken from a variety of 


sources, and represents the best current 
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TABLE 2 


ESTIMATED RELATIONSHIPS AMONG FIVE DIMENSIONS 
OF CHARACTER STRUCTURE 


Correlations 


Au- 
ton- 


| 
Dimensions | | 

| So | Em | sea 

| | оту 


.00 | .30 .00 30 


Moral knowledge 
00 -26 09 


Socialization (So) 
Empathy (Em) T" 
Ethics of responsibility (SEA) 


Note. SEA = Survey of Ethical Attitudes, 


estimate of the magnitude of these relation- 
ships. The correlations with moral knowledge 
for socialization, empathy, and the Survey of 
Ethical Attitudes are based on associations 
with the Scholastic Aptitude Test. The cor- 
relation between autonomy and moral knowl- 
edge is an estimate derived from Crutchfield’s 
(Krech, Crutchfield, & Ballachey, 1962, p. 
526) report of a dependable relationship be- 
tween IQ and nonyielding in his experiments. 
The other correlations summarize data from 
the author’s files. No claims are made for the 
generality of these coefficients, and other in- 
vestigators may want to determine their size 
for themselves. 

These dimensions also work moderately well 
together. Hogan and Dickstein (1972a) gave 
three fraternities (W = 92) а 15-item sen- 
tence completion test on which subjects were 
asked to respond quickly and briefly to such 
statements as: (a) I think it is unnecessarily 
cruel to keep condemned prisoners on death 
row for so long, and to make the execution 
such an elaborate ritual. (5) The police should 
be encouraged in their efforts to apprehend 
and prosecute homosexuals. Homosexuality 
threatens the foundations of our society. 

Responses to these items were scored for 
overall maturity of moral judgment using the 
following scoring elements: (a) concern for 
the sanctity of the individual; (Б) judgments 
based on the spirit rather than the letter of 
the law; (c) concern for the welfare of so- 
ciety as a whole; (d) capacity to see both 
sides of an issue. 

Four raters scored each protocol; 2 points 
Were assigned to an item if any of the four 
Scoring elements was clearly present in the 


ROBERT HOGAN 


response, 1 point if any of the scoring ele- 
ments could be readily inferred, O if they 
were missing. The interrater reliability was 
estimated to be .88, and the reliability of the 
test itself was .82. The validity of this pro- 
jective measure was estimated as follows: 
Each member of the fraternity rated the 
others for “sensitivity to injustice," defined as 


. one who is quick to perceive unfairness in the 
decisions of persons or groups, or in the treatment 
that persons or groups receive from others. He may 
or may not openly express his concern; however his 
feelings will be obvious to those who know him. 
One who is insensitive to injustice will be less likely 
to notice the unfairness in a situation, and will 
rarely show concern when such unfair treatment is 
pointed out to him. 


The reliability of these ratings after refine- 
ment by factor analysis varied between .86 
and .93. The correlations between scores for 
maturity of moral judgment and rated sensi- 
tivity to injustice were .43, .13, and .52 for 
the three fraternities, respectively. 

If scores for maturity of moral judgment 
are taken as an index of “moral maturity,” 
then correlations between these scores and the 
five dimensions of character development pro- 
vide some indication of how well the total 
model works. These data are presented in 
Table 3, which contains no information about 
the relationship between mature moral judg- 
ment and moral knowledge. Kohlberg (1964) 
reports, however, that for a population of 72 
adolescent boys the correlation between IQ 
and maturity of moral judgment (with age 
controlled) was .31. With the exception of 
the Survey of Ethical Attitudes, the correla- 
tions in Table 3 support the model well; for 
example, persons who make mature moral 


TABLE 3 
CORRELATIONS BETWEEN Four Dimensions OF CHAR- 
ACTURE STRUCTURE AND Maturity OF 

MORAL JUDGMENT 


Mature moral 


Dimension judgment 
Socialization r1 
Empathy 8 40 
Ethics of responsibility (SEA) PS 6 
Autonomy 5 


Note. N = 41. SEA = Survey of Ethical Attitudes. 
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Judgments tend to be well socialized, em- 
PE and autonomous. The negative cor- 
E. E W ith the ethics of responsibility, how- 
the сон interpretation. High scores on 
Es sic Ethical Attitudes reflect. the 
the ls iii low scores indicate 
m ilies conscience. 1 he mean score on 
MN N ps of Ethical Attitudes for this sam- 
ings oe ^s, and the range was 4 25. Rat- 
Dim 5 ks e moral judgment varied be- 
Bivariate =. 14.0. An examination of the 
бае Gee reveals that the two persons 
Ше iuo he est (ethics of responsibility) and 
TN s hei lowest (ethics of conscience) 
E ios н the survey together received an 
Гаек maturity rating of 40. On the 
ри e. the person with the highest moral 
of 18 A ms in the sample received a score 
generally the survey. Thus, the scatter plot 
the iia the prediction derived from 
ethics.of. that persons in the middle of the 
mension conscience-ethics-of-responsibility di- 
either e carm more mature than the persons at 
Xtreme. 


CHARACTEROLOGICAL IMPLICATIONS 


Eos Шш dimensions of character develop- 
acter a s inexorably toa discussion of char- 
number Des. Psychologists have proposed a 
Varying of character typologies which, while 
саге considerably in detail, also have many 
Eon |" ommon (сї. Kay, 1968; Wright, 
Velo pi, n The Psychology of Character De- 
Senteq =, Peck and Havighurst (1960) pre- 
ese: what is perhaps the most important of 
зан might be useful to trace the parallels 
cter is their typology and the model of char- 
COM pment presented here. To define 
er devel y the primary dimensions of charac- 
uel d» Peck and Havighurst (1960) 
5 « in situ ratings of adolescent subjects 
reorref a ү traits. These traits were 1n- 
actors wee factor analyzed, and the three 
de ich emerged should be familiar. The 
Scribes 


liy, Pehavior which conforms positively to the estab- 
Vecto; Moral code , , , a person who exemplified this 
Conf 1S one who is conventionally moral, socially 
®proveq S to both the maternal and peer codes o! 
Чопа "ed behavior, and shows no hostility or emo- 
that t е comfort in doing so. .-- We might infer 

Person high on this characteristic conforms, 


not so much from pressures to do so, but because 
he enjoys it and finds it the most comfortable and 
satisfying way to live [pp. 244-2451. 

This dimension is clearly socialization. 

Peck and Havighurst’s (1960) second di- 
mension was defined by (a) the “degree to 
which subject accurately recognizes the usual 
behavior patterns of the people around him, 
can predict what they are likely to do, and 
how they are likely to react in a given ac- 
tion [p. 221]”; (5) the “capacity to under- 
stand other people’s reasons for acting as they 
do [p. 221];" (c) the “capacity for ‘feeling 
with’ other people; for experiencing, at least 
to some extent, the same emotions they are ex- 
periencing at the moment [p. 221];" and 
(d) the “degree to which conscious or precon- 
scious self-perception is congruent with the 
behavioral self [p. 223].” This second dimen- 
sion closely approximates empathy. 

The third dimension was more ambiguous. 
It was defined as *. . . the degree to which 
behavior is directed by, or is in accord with 
a present and functioning superego [p. 236]." 
This depends on 


. . . the degree to which superego directives are inte- 
grated into behavior, ie, a person with only selí- 
punitive superego “voices” which do not forestall 
immoral behavior does not have an integrated system 
of principles . . - guilt per se is not a direct measure 
of superego strength . . < [Peck & Havighurst, 1960, 
p. 236]. 

The third dimension resembles what we have 
called autonomy. 

Peck and Havighurst next define five char- 
acter types, labeled amoral, expedient, con- 
forming, irrational-conscientious, and rational- 
altruistic, The first represents a person who 
lacks any self-imposed control or concern for 
the moral requirements of social living. How- 
ever, the “. . . last four represent the four 
kinds of reasons why a person may behave 
according to the moral standards of his so- 
ciety [р. 4].” And the development of these 
character types is explained in terms of the 
dimensions we have called socialization, em- 
and autonomy (cf. Peck & Havighurst, 
1960, p. 87). That is, expedient types were 
intelligent enough to understand the utility 
of conformance; however, they were unso- 
cialized, nonempathic, and nonautonomous. 
Conforming subjects were well socialized but 


pathy, 
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neither empathic nor autonomous. Those clas- 
sified as irrational-conscientious were well 
socialized, autonomous, and nonempathic. 
Only the few rational-altruistic persons were 
socialized, empathic, and autonomous. 

Let us now review the complete model by 
considering in what sense it can be called de- 
velopmental. Two themes define the develop- 
mental perspective. The first is that there are 

efie endpoints to development, endpoints 
Which are preceded by qualitative changes 
ove! e. According to the model presented 
here, the ideal endpoint of moral development 
is moral maturity, defined in terms of op- 
timal placement on the five dimensions; the 
practical endpoint is social conformance, 
which can be explained in terms of these di- 
mensions. (The relationship between the two 
endpoints is seen in the fact that while the 
morally mature typically conform to rules of 
conduct, all those who conform are not neces- 
sarily mature.) 

Moreover, the emergence of socialization, 
empathy, and autonomy represent transition 
points which are followed by qualitative 
changes in the dynamics of social behavior. 
That is, until a child becomes socialized, he 
is egocentric, impulsive, and undisciplined. 
After socialization but before empathy de- 
velops, he is excessively respectful of adult 
authority. An empathic but nonautonomous 
person places a greater priority on human 
needs than on the maintenance of rules; how- 
ever his conduct remains closely tied to the 
expectations of his peer group. It is only after 
a degree of autonomy has been achieved that 
behavior may become independent of external 
controls, In contrast with these three dimen- 
sions, moral knowledge and the ethics-of- 
conscience—ethies-of responsibility continuum 
are not developmental in the sense described 
here. 

The second theme defining the developmen- 
tal perspective is that life is a continuous 
process of adjusting internal conditions to 
external demands; these adjustments are made 
necessary by changes which occur in both the 
organism and the environment. The present 
model also incorporates this characteristic of 
developmental theory. The major problem for 
an infant is securing adequate nurturance and 
Protection from his caretakers, Moreover, as 
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the child becomes mobile, he tends to wander 
further from his mother’s protection, thereby 
exposing himself increasingly to danger. Thus 
obedience to maternal commands both fa- 
cilitates mother-child interaction and im- 
proves the child's chances for survival. The 
primary developmental task for a child in 
the middle years is to become a social being, à 
process brought about almost exclusively 
through peer interaction. For the typical child 
such interaction begins around 4 or 3 years 
of age, where because adaptation to the group 
requires accommodation to a radically ex- 
panded set of norms, continued strict obedi- 
ence to parental commands is maladaptive. 
The development of empathy greatly improves 
the capacity to anticipate and the willingness 
to comply with the expectations of the group— 
the prerequisite for sociability—and increases 
flexibility in peer interaction. Finally, the 
most important problem for the young adult 
is to establish his own life. Yet most young 
people are caught between the internalized de- 
mands of their family and the external ex- 
pectations of the peer community. Thus the 
development of autonomy is an important 
solution to the functional dialectic of develop- 
ment; it allows one to deviate from both par- 
ents and peers, but in a (hopefully) prosocial 
fashion. Seen in this way, the emergence О 
socialization, empathy, and autonomy аге 
adaptive within the context of the social en- 
vironment during maturation; they are Ca 
pacities which mediate both the needs of ш 
individual and the demands of his socia 
group. " 
The present model differs from conventio! 
developmental theories in one important n 
spect. Most such theories describe develop 
ment as proceeding, through preset stages i 
à step-wise fashion, and successive levels A 
be achieved only after the preceding stage" 
have been mastered. This paper suggests ue 
socialization, empathy, and autonomy ? 
major transition points in moral developmen 
which occur at progressively later points ba 
time. Moreover, once attained, these capaci? 
bring about qualitative changes in the un ich 
lying structure of moral conduct [ub 
phenotypically may remain unchanged). n 
this model, however, attainment of the € 
“stages” is not dependent on successful tran 


nal 


lion through the earlier levels. Rather, all 
three stages are distinct developmental chal- 
lenges whose outcome defines each person’s 
unique character structure. 

Because contemporary social theorists often 
Seem unconcerned with the problem of con- 
formity, one last comment on this topic is 
appropriate. The model presented here de- 
fines the formal parameters of moral maturity. 
However, these qualities may rarely exist in 
а single person. If so, then the obvious con- 
clusion is that most people are less than 
rues mature. Furthermore, the proportion 

ugs and psychopaths in society seems in- 
dependent of their place in the social order; 
te class determines the form rather than 
he frequency of delinquency conduct. In ad- 
dition, evidence from history suggests that to 
expect people to be compassionate, charitable, 
and tolerant is patently unrealistic. As Camus 
(1955) observed, the “Be virtuous” of our 
eonfessionals reveals “а nostalgia at the same 
gs as an ignorance;” it is a sterile exercise 

a great subject. Paralleling Camus, Freud 
ed maintained in his later writings 
{ы} пе most we can reasonably expect is 

people will comply with the rules, that 
Sten ш avoid doing evil, that they will not 
Bs property, molest our families, or 
us. Consequently, since little beyond 
cane possible, the practical endpoint of 
development, social conformance, must 
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ELECTRODERMAL ACTIVITY AS AN INDEX OF 
AROUSAL IN SCHIZOPHRENICS* 


RICHARD A. DEPUE 2 


Southern Ilinois University at Carbondale 


Previous reviews of arch utilizing el 
arousal in schizophrenia have found no 
phrenics and normals, although other psy 
chronic schizophrenics to be more arouscd 
dermal research, however, 
for two aspects of electrodermal activ 
during periods of minimal stimulation, 
sity and (b) frequency of spontaneous 
stimulus conditions. Both of these mea 
vield results indicating that chronic scl 
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nus the earlier writings of Duffy (1951), 
(1 or de Hebb (1955), and Malmo 
tion) р he construct of arousal (or activa- 
explanate gained enormous impetus as an 
and aaepe f concept in theories of behavior 
and th provided a framework for research 
(Lace Sory in the area of psychophysiology 
tion E rii Wilder, 1950). The culmina- 
theory А work has resulted in а generalized 
Proven | arousal, which more recently has 
researcher bea powerful explanatory tool for 
Ogica] deh attempting to account for psycho- 
1968. fon in schizophrenia (e.g., Broen, 
1966) Hs & Buss, 1965; Venables, 1964, 
ized ar owever, a major criticism of general- 
arousa] Ousal theory is that the indices of 
nomic (electrocortical, neuromuscular, auto- 
intere; Show low, sometimes insignificant 
пау; * elations, especially when taken across 

duals (Lacey, 1967). Thus an inference 


Neer: 
n I arousal must be based on more 
et response system. 
Schizo ature on the psychophysiology of 
lem ia enia presents much the same prob- 
Ttaining to measurement of arousal 
ieee 
he 
ported in eParation of this manuscript Was sup- 
ТЕ-07035 {г by Biomedical Sciences Support Grant 
E on from the National Institutes of Health. 
pal Sue Was made available in the form of a 
Fane for health-related research awarded by 
a ae College of the University of Iowa. 
Critica] s wish to thank Andrew L. Dickson for 
^" reading of the manuscript. ; 
5 for reprints should be sent to Richard 
nive 'epartment of Psychology, Southern Ti- 
TSity, Carbondale, Illinois 62901. 


1 


2 


233 


reveals that consi 
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lectrodermal activity as an index of 
consistent. differences between schizo- 
chophysiclogical measures have shown 
]. A more detailed look at the electro- 

istent results have been reported 
(а) habituation of responses or levels 


such as tones and lights of low inten- 


electrodermal responses regardless of 
isures appear to reflect arousal and 
hrenics are overaroused. 


hizop| 
level. Moreover, the data seem most divergent 
when comparing indices within the autonomic 
nervous system. For example, in a recent 
review, Lang and Buss (1965) concluded 
that only the cardiovascular system has con- 
sistently differentiated normals and schizo- 
phrenics in terms of arousal level (chronic 
schizophrenics showing autonomic nervous 
system overarousal) whereas electrodermal ac- 
tivity has produced inconsistent results. Gold- 
stein and Acker (1967) and Broen (1968) 
have come to similar conclusions concerning 
these autonomic nervous system indicators. 
In view of the present status of arousal the- 
ory, it would be most valuable to demonstrate 
that chronic schizophrenics are overaroused on 
an autonomic measure other than those which 
reflect cardiovascular functioning. 

Because the electrodermal data appeared to 
be inconsistent with the general trend of 
psychophysiological research in schizophrenia 
(i.e., overarousal in these subjects), a more 
microscopic examination of these results 
seemed in order. Our review of the literature 
rmed the previous conclusions that, in 
hizophrenics differ electrodermally 
als neither in terms of tonic levels 
(kei 


confi 
general, sc 


from norm 
with respect to phasic response 


nor 
galvanic skin response) amplitudes. Exclu- 
sion of studies including patients receiving 


medication did not alter these conclusions. 
Since these studies have already been re- 
viewed by others (Lang & Buss, 1965), they 
will not be discussed here. Two aspects of 


electrodermal activity did, however, ap- 
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pear to produce consistent results indicating 
greater arousal for schizophrenics, and those 
are reviewed below. 


Spontancous Responses 


In continuous recordings of electrodermal 
activity, nonspecific or spontaneous fluctua- 
tions which are uncorrelated in time with 
specific stimuli presented by the experimenter 
frequently occur. They are defined in the 
same manner as are phasic responses? except 
that they occur outside of the latency limits 
set for specific responses (usually 1—3 sec- 
onds after stimulus onset). There is con- 
siderable evidence that measures of frequency 
of responses are correlated with arousal 
level in normals (Burch & Greiner, 1960; 
Greiner & Burch, 1955; Silverman, Cohen, & 
Shmavonian, 1959; Surwillo & Quilter, 1965) 
and are independent of base-line measures 
(Lacey & Lacey , 1958), high frequencies 
being associated with increased arousal. Simi- 
larly, this measure has been used exten- 
sively as an index of arousal in psychiatric 
populations (Lader, 1971). 

Of the investigators using spontaneous- 
fluctuation measurement, all have reported 
greater frequencies for schizophrenics except 
those employing patients on phenothiazines 
(Bernstein, 1964, 1970; Fenz & Velner, 
1970). Thus, under conditions of minimal, 
nonsignal stimulation, Zahn (1964) and 
Zahn, Rosenthal, and Lawlor (1968) reported 
spontaneous fluctuation frequency in schizo- 
phrenics (both acute and chronic) to be three 
times that of normals, though this difference 
decreased during a simple reaction time task 
in another study. Moreover, in a ratio of 
Specific to nonspecific responses analogous to 
a signal-to-noise ratio, normals had a reliably 
higher ratio than schizophrenics, indicating 
that the greatest percentage of the latter 


group’s responses were to nonspecific stimula- 
tion. 


?Phasic responses are usually defined as either 
à response of at least 300 ohms (this varies 300-500 
ohms) deflection occurring. within 1-3 seconds fo]. 
lowing specific stimulus onset (e.g., Bernstein, 1970; 
Zahn, Rosenthal, & Lawlor, 1968), a change score 
from basal level (ea, Pishkin & Hershis 1963), 
or the largest responses in an individual’ record 
(eg, Goldstein & Acker, 1967). 
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Using similar stimulus conditions, Bernstein 
(1967) found both regressed and remitted pa- 
tients to produce a greater number of spon- 
taneous fluctuations than controls, while other 
researchers (Fowles, Watt, Maher, & Grin- 
spoon, 1970) reported significantly greater 
than normal spontaneous fluctuation activity 
in good and poor premorbid patients under 
resting conditions, with poors exhibiting the 
greatest frequency. А 

Investigations utilizing psychologically 
stressful conditions revealed similar results. 
Thus, Syz (1926) and Syz and Kinder 
(1928) reported 21% and 8% тоге аро 
taneous fluctuation activity in paranoid HL 
catatonic patients, respectively, than in we 
trols during periods of listening to emotiona y 
provoking words. No response to words wa 
required. Furthermore, Syz reported data 
similar to Zahn’s (1964; Zahn et al., 1968) 
mentioned earlier in that paranoids and wer 
tonics revealed fewer responses to epe 
stimuli (12% and 29% fewer, respectivei d 
Most striking here is the finding of s 
ity between paranoids and other schizophre 
subtypes in terms of spontaneous ugg 
frequency and signal-to-noise raito, since m e 
other measures reveal divergence for thes 
subjects. Similarly, Jurko, Jost, and e 
(1952), comparing early paranoids and na 
mals, observed greater frequency of güditin ы 
responses in patients during recovery |t 
ods following pictures depicting frustra"? 
circumstances. 

Finally, Stern, Surphlis, and Koff ` 
found that when tested periodically with "eck 
stimulation (70 decibels) over a v" 
period, schizophrenics on phenothiazines, Pi ; 
viously designated as having a good progno, v 
significantly decreased in spontaneous T 
ation frequency while poor prognostic. Ve 
tients maintained their initial frequen 
Though similar data were reported for van 
stimuli, the results were less dramatic 


1965) 
off ( one 


n 
A «Ати dic 
those for tones. Since the two groups d f: 


0 
s ” ; rms 
differ prior to drug treatment in te! 


at- 
spontaneous fluctuation frequency, pa 
tractive notion that decreasing spontane 

fluctuation activity over time is indicat aa 
improved health deserves further wp 
tion. Moreover, the efficacy of frequency b- 
sures in differentiating schizophrenia ^ 
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lypes when patients are on drug treatment 
15 most interesting and also deserves further 
research. Unfortunately, the lack of a 
Control group precluded comparisons with 
normal subjects. 

Da ЧИЕ feature of the above studies is 
ü nonstration that spontaneous fluctua- 
lon frequency. measures differentiate patients 
тни under a variety of stimulus 
aes and not just those of minimal 
E ed However, some evidence suggests 
fhe Aris ec requiring overt responses on 
i» on e M oe meia are less efficacious 
(1964) Min h 1 owles et al. (1970) and Zahn 
a Аа на less difference in spontane- 
bns M frequency between patients 
Bis I 7 s during reaction time tasks than 
et al ps under conditions of rest (Fowles 
Шон z 70) or minimal, nonsignal stimula- 

Zahn, 1964). 


Habituation 


or dabituation to an experimental situation 
two кы series of stimuli may be quantified in 
ays with electrodermal measurements. 
pe traditional way is by measuring 
{һе plitude or frequency (ie, probability) 

a feet TE response to a series of stimuli, 
response i being defined as a decrease in 
Conventie amplitude or probability. A less 
Of skin nal method is to use measurements 
eve] “ween level or skin potential 
Cline in h habituation being defined as a de- 
hing t these measurements from the begin- 
bus the end of a specified time period. 
in t ae in this second sense can be used 
ample absence of specific stimulation, for ex- 
vel ae periods, Since skin conductance 
absence. skin potential level decline in the 
Teabso of sweat gland responses because of 
rption of sweat in the duct (Fowles, 
deis extent of this type of habituation 
Deous g У related to the amount of spon- 
ls or " Weat gland activity during rest peri- 
3 юана the combined effect of specific and 
Stimulus Ous responses during а series of 
ou Presentations, The greater spontane- 
h Chine ion activity found for anho 
abit ка lead to a prediction of ітресес 
"or a lon ior these subjects relative to 
The 5. This has been found to be the сазе. 
Most extensive analysis of habituation 
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in schizophrenia was undertaken by Zahn 
(1964) and Zahn et al. (1968) using tones 
and lights of low intensity. While analysis oi 
basal levels in these studies revealed higher 
overall skin conductance level in chronic pa- 
tients, this result was attributable to a failure 
to show a decrease in skin conductance level 
across trials in the patient group relative to 
normals, and similar trends were found for 
spontaneous fluctuation frequency over time. 
The results were similar for both tone and 
light stimulation, though more pronounced 
with light. 

Turning to analysis of orienting responses 
to specific stimuli, Zahn et al. (1968) found 
similar results. In a subsample of patients 
matched with normals on mean and range of 
total orienting response frequency, schizo- 
phrenics showed invariant responding in terms 
of the number of specific orienting responses 
across trials, whereas normals steadily de- 
creased in number. Upon closer inspection of 
the data, it was discovered that while normal 
subjects were characterized by gradual dimi- 
nution of orienting response frequency, schizo- 
phrenics showed irregular stop-and-go re- 
sponding. In light of the lower specific/non- 
specific ratio found for patients, Zahn et al. 
(1968) suggested that this group's slower rate 
of habituation to specific stimuli was due in 
part to constant yet partial dishabituation re- 
sulting from attentional shifts to novel, non- 
specific stimuli. Though this process would not 
preclude complete habituation, it would never- 
theless impede its rate. 

One additional measure of orienting re- 
sponse habituation confirmed the above re- 
sults (Zahn, 1964; Zahn et al, 1968). With 
zero amplitude responses eliminated from 
analysis, both acute and chronic schizo- 
phrenics exhibited slower habituation of 
specific orienting response amplitude than 
controls across trials. A similar finding was 
reported for active schizophrenics in a study 
utilizing similar stimulus conditions (Ven- 
ables, 1960), although differences did not hold 
when background stimulation was presented 
nor for withdrawn patients. 

It is of interest that if the above analyses 
in Zahn et al.’s (1968) study had been com- 
puted on the first eight trials alone, the dif- 
ference in habituation rates would not have 
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been discovered. It was only after the first 
eight trials that normals began to decrease 
in response amplitude and frequency, whereas 
schizophrenics continued to respond in an 
invariant manner. Bernstein (1967) observed 
a similar divergence. While patients and con- 
trols showed equivalent responding initially, 
base-line and spontaneous fluctuation mea- 
sures decreased in normals during the last 
4 minutes of the 6-minute stimulation period, 
whereas both regressed and remitted schizo- 
phrenics revealed invariant basal reactivity 
and increasing spontaneous fluctuation fre- 
quency. Apparently in the Zahn and Bern- 
stein studies, controls were exhibiting normal 
orienting response habituation while schizo- 
phrenics were not. Along the same lines, Israel 
(1966) reported slower habituation of skin 
conductance level for schizophrenics during 
exposure to mild photic stimulation, but no 
difference in overall skin conductance levels. 
Furthermore, the divergence over trials was 
greatest for a subsample of patients charac- 
terized as poor assimilators of novel stimula- 
tion (i.e., poor habituators). Similarly, Fowles 
et al. (1970) noted slower habituation of 
skin potential level in poor premorbids rela- 
tive to goods and controls during an initial 
10-minute rest period, while none of the 
groups differed in initial skin potential level. 
Poors also showed invariant spontaneous 
fluctuation responding during a subsequent 
reaction time task relative to normals. 

Finally, Stern et al. (1965) again confirmed 
the value of using electrodermal activity as 
an indicator of improved health in patients. 
Upon admission, good and poor prognostic 
Schizophrenics failed to differ in terms of 
habituation of orienting response reactivity 
to tones and a word-association task. How- 
ever, 5 weeks later, poors were found to 
habituate less rapidly than goods, although 
the results were less pronounced with the 
word-association series. 

In summary, then, the invariant responding 
by schizophrenics across time appears to be 
effective in differentiating normal and schizo- 
phrenic groups. In addition, a few studies 
(Fowles et al., 1970; Stern et al., 1965) sug- 
gest that patients with poor prognoses habitu- 
ate less rapidly over time relative to good 
prognostic subjects. It appears that good and 


RICHARD A. DEPUE AND DON C. FOWLES 


poor premorbids may not differ in habitua- 
tion (and spontaneous fluctuation activity) 
upon admission, but that goods subse- 
quently exhibit relatively normal patterns of 
responsivity more quickly than poors. 

It is important to note that habituation 
effects are demonstrable with amplitude a5 
well as frequency measures of electrodermal 
activity, One reason for this is that habitua- 
tion is evaluated in terms of variations 1? 
amplitude within-subjects rather than mM 
terms of the absolute amplitude of responses: 
The use of within-subject changes eliminates 
many of the problems of absolute measures, 
such as variations in sweat gland density, 
area of contact, etc., as most of these vari- 
ables are constant for a given subject. As à 
result, intrasubject changes in response ampli- 
tude more accurately reflect changes in swea 
gland activity and, therefore, sympathetic 
nervous system activity, than do absolute 
measures. 


Summary and Conclusions 
Р s re- 
Measures of frequency of spontaneous 


Е M Pa cos nel. 0 
sponses yield consistently higher a 

; А ә а 
response for schizophrenics whether my 


А a : г -nsteln 
sured during minimal stimulation ( Bernstel?: 


1967; Fowles et al, 1970; Zahn, 1 
Zahn et al., 1968) or during more w 


stimulation (Jurko et al., 1952; Syz, ties 
Syz & Kinder, 1928). In contrast, the € 
showing poor habituation of schizophrenic 
using either tonic levels or phasic respo Б 
amplitudes have involved а шїї К 
stimulation (Bernstein, 1967; Fowles 2 ab 
1970; Israel, 1966; Zahn, 1964; Zahn €" "Y 
1968). Thus it appears that measures 


: f as 
intrasubject changes in tonic levels and Pi ۴ 
mo: 


whereas 


not been established, but it is likely #006 
measures of level and response pdt F> 
become less effective as arousal iu. 
cause of a ceiling effect or other limita M 
the peripheral mechanism producing e 
dermal phenomena. mM 

Evidence has already been cited BED ive 
that spontaneous fluctuations are tle n 
of arousal. The finding of poor habitu 


ing 


NT 
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among anxiety neurotics (Lader & Wing, 
1966) suggests that the poor habituation of 
schizophrenics is also a result of overarousal. 
Indeed, a large number of studies have shown 
that resistance to habituation and fre- 
quency of spontaneous responses are corre- 
lated (ranging from .40 to .74), suggesting 
that they may be viewed as alternate indi- 
cants of the same underlying dimension 
(Crider & Lunn, 1971). These results with 
Schizophrenics, then, are consistent with those 
Obtained with cardiovascular activity (Lang 
& Buss, 1965) and extend the findings of 
Overarousal in schizophrenia to a second 
response system within the autonomic nervous 
System, 

These conclusions apply to average results 
With chronic schizophrenics, and especially to 
honparanoid chronic schizophrenics as defined 
by Venables (1964). It is also likely that there 
are other subgroups among chronic schizo- 
Dhrenics to which these conclusions do not 
apply, There is already suggestive evi- 
dence that active schizophrenics as assessed 
(V the Venables’ activity- vithdrawal scale 

enable А raroused. The 
ee سا‎ been re- 
lated t cipe sepan ; 
eed to arousal, active patients hang at I 
196 nd of the arousal continuum (Venables, 
T TU Venables & Wing, 1962). More- 
ver, these patients have been found to ex 
Ibit inferior performance relative to normal 
and neurotic controls on an information- 
Processing task under standard conditions but 
Not when arousal was induced in all subjects 
y Means of induced muscle tension.* Thus 
Sir performance was actually improved by 
Few arousal. Similarly, Venables m 
ductio that active schizophrenics showe a те- 
sponses iH the frequency of orienting Te- 
tion мн ап їпсгеазе їп background stimu- 
Sc izo ы Seems unlikely, therefore, that active 
ata is are overaroused. Further T 
пе э of this group of patients woul 
arousa]. clarify the relationship l 
ified д and various types of behavior clas- 
S schizophrenic. 
ere is no reason to believ: 


in 


e that chronic 


ceptual or- 


4 
withdrawn 


anias Depue, and D. C. Fowles. Con 
Schizophr and arousal in active and 
renics, Submitted for publication. 
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schizophrenics are more aroused than are 
anxiety neurotics. Indeed a review of studies 
of anxiety neurotics would reveal results very 
similar to those reported for chronic schizo- 
phrenics. There must be an additional factor, 
probably some type of cognitive deficit, which 
contributes to schizophrenia in addition to 
overarousal. Patients who suffer from both 
chronic arousal and the cognitive deficit are 
likely to become chronic. This suggestion 
resembles the type of model proposed by 
Broen (1968), in which arousal interacts with 
a cognitive deficit (lowered maximum re- 
sponse strength ceiling) to produce the schizo- 
phrenic behavioral deficit. The cause of over- 
arousal is a purely speculative matter at this 
point. The comparison with anxiety neurotics, 
however, suggests that it may be caused by 
an inhibitory deficit, as suggested by Malmo 
(1959). 
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THE DEVIL RIDES AGAIN: 


CORRELATION AS AN INDEX OF FIT? 


MICHAEL H. BIRNBAUM ? 
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Correlations between theoretical predictions and data can be higher for in- 


correct than for correct models 
hypothetical data 

studies that use cor 
vides a sounder b 
of model fitting 


and by testing 


as illustrated by analyses of two sets of 
questions about the conclusions of recent 
ition as an index of fit. Functional measurement pro- 
for model evaluation by placing scaling in the context 
deviations 


from prediction rather than 


concentrating an overall goodness of fit, 


p widely recognized as an “instru- 
Чой Ито the devil? when used to infer causa- 
nc confounded data, the correlation 
it of pw s still employed as ап index of the 
Оц ена, models. There are two seri- 
models tclsms of this usage. First, incorrect 

А highly with data 
& Torgerson, 
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(Anders : 
nee 1971; Yntema 


de based upon the assumption that 
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Specia |, cues аз income and the opportunity to use 
fi interests, The multiplicative model yielded 
the Correlations than the linear model for 32 ot 
Tine; Subjects, Goldberg (1971) found that the 
culti jode yielded higher correlations {һап the 
үт we for each ‘of 29 clinicians, who at- 
P si t differentiate. neurosis from psychosis on 
P S of the Minnesota Multiphasic Personality 


ТУ profiles. 


The point of the present paper is to warn 
that the devil may also be at work when cor- 
relation is used in this way: Predictions from 
an incorrect model sometimes correlate better 
with the data than predictions from the 
correct model. 


LINEAR MODEL CORRELATES WITH 
MULTIPLICATIVE DATA 


Table 1 illustrates one way this can 
happen. The matrix of hypothetical, errorless 
data might represent clinical ratings of the 
degree of neuroticism, 1, based upon two test 
scores, Х and Xs, with eight and three levels, 
respectively. 

The usual Zigear model can be written as 
follows: 

[1] 
where Y is the predicted value for the data 
in the table, X; and Х are the a priori values 
of the two predictor variables (in this case, 
test scores), and a, b, and c are the linear 


coefficients. 
The multiplicative model (Einhorn, 1970) 


Y =a + bX + eX», 


can be written * 
[2] 


where F, Xi, and X» are defined as above, 
and a, b, and c are constants. Taking loga- 


4 Einhorn (1970) has defined Equation 2 as the 
“conjunctive model.” Equation 2 captures some of 
the intuition behind the notion of a subjective 
“conjunctive” strategy: however, the multiplicative 
model (Equation 2) is compensatory and therefore 
should not be confused with the traditional defini- 
tion of the conjunctive model (see Coombs, 1964). 
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TABLE 1 TABLE 2 
HYPOTHETICAL MULTIPLICATIVE DATA HYPOTHETICAL ADDITIVE DATA 
A A priori value of first cue A priori A priori value of first cue 
priori value of — 
value of — CO. second | D | | 
second 1 cue | a 2 + | ж | 16 
cue i Е: TETE 6 7 8 " = - - 
| 1 mH | ds i6 | 17 | 18 
1 | 15 (18 21 | 22 | 27 | 30 | 33 | 36 2 15 16 17 18 19 
2 |2122) 24 | 26 | 28 | 30 | 32 | 34 a | 4% 17 18 19 | 20 
3 | 25 | 36 | 27 | 28 | 20 | 30 | 31,32 8 17 | 48 19 30) | 2i 
ft : — 16 18 19 | 20 21 | 22 
rithms of both sides of Equation 2 gives: | | 


log У = log a + blog X; + clog Xs, [3] 


which, like Equation 1, can be fitted by mul- 
tiple linear regression with three constants. 
The fit of Equation 1 to the data in 
Table 1 yielded a correlation of .930. Figure 
1A plots these data against the best-fit pre- 
dicted values in the usual way; the fit looks 
reasonably good. However, Figure 1B shows 
the same data plotted as a function of Xj, 
With a separate curve for each level of X». 
The nonparallelism of these curves indicates 
serious violation of the additive model of 
Equation 1. It should be clear that the data 
conform instead to a multiplicative model. 
Einhorn's technique would fit his multipli- 
cative model to the data by means of Equa- 
tion 3, which yields a coefficient of only .899, 
This example shows that even when the data 


are perfectly multiplicative, comparison of cor- 
relation coefficients can lead to the erroneus 
conclusion that the linear model provides the 
better representation. The source of this dif- 
ficulty lies in the assumption, implicit in the 
use of Equation 3, that the a priori values 
of X and Y are positive and known to a 
ratio scale. As shown below, the data of 
Table 1 fit Equation 2 perfectly when linear 
transformations are applied to the a priori 
scales. 


MULTIPLICATIVE MODEL CORRELATES WITH 
ADDITIVE DATA 


A second example shows that predictions 
Irom the multiplicative model can correlate 
better than those from the linear model, even 
when the linear model is more appropriate. 
The hypothetical data in Table 2 might be 
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Fic. 2. A—Correlation of multiplicative model with perfectly additive data 


of Table 2. (Graph points represent varying numbers of data points.) B—Same 
data plotted as a function of Xi, with a separate curve for each level of Xs. 
lism of the curves in B indicates perfect additivity, the 
ior fit because it requires the curves to be 


(Although the paralle 
linear model provides an infer 


straight-line functions of the a priori values of Yi: and Xs.) 


ratings, Y, of the attractiveness of various 
Jobs based on two cues, X; and X». 

Fitting the linear model to these data 
yields a correlation of only .933 compared 
with .998 for the multiplicative model. It can 
be inferred from Figure 2A, which plots the 
fit of Equation 3 to the data, that the use 
Of rank-order correlation (Einhorn, 1970, 
1971) would have “improved” on the already 
excellent fit of the wrong model. The multi- 
Dlicative model is the wrong model because 
the data are perfectly additive, as shown by 
the parallelism of the curves of Figure 2B. 

he investigator using correlation as an index 
Be E might erroneously conclude that the 
data reveal a “conjunctive” judgmental 
Strategy unless the data were plotted as in 
Figure 2B. It is only because the linear 
regression model requires the data to plot as 

inear functions of the a priori values of X: 
and X, that it yields a lower correlation. 


FUNCTIONAL MEASUREMENT PROVIDES 
BETTER TEST 


Functional measurement (Anderson, 1970, 
) correctly diagnoses the same tipoti: 
th data. The linear model is additive E 

erefore predicts а zero interaction between 
the Cues. Equation 2 is multiplicative e 

*refore predicts an interaction which should 
€ located entirely in the pilinear component. 


These interactions can be tested graphically, 
as in Figure 1B and 2B, and statistically, 
using the analysis of variance. 

The intersecting linear curves of Figure 1B 
are indicative of a multiplicative model with 
a functional zero at the point where the 
curves intersect. When Y is replaced by 
Y 30, X, is replaced by X; — 6, Xs is 
replaced by 4— Xs, and a—5-—c-— ik, 
Equation 2 fits the data of Figure 1B 
perfectly. 

The parallelism of the curves of Figure 2B 
indicates that there is no interaction between 
cues, in agreement with an additive model. 
In the case of additive models, the marginal 
means estimate the functional scale values for 
the cues. Equation 1 fits these data perfectly 
when the marginal means of Table 2 are 
substituted for the a priori values of X. 


COMMENTS AND CONCLUSIONS 


These two examples demonstrate that even 
with factorial designs and errorless, interval 
data, the correlation coefficient can be an 
inappropriate instrument for comparing the 
fit of models with equal numbers of param- 
eters, In real-life applications, correlations 
may be even more misleading. 

Correlations can be diabolical when fac- 
torial designs are not employed. The reader 


can easily select those cue combinations from 
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Figure 1B and 2B that would improve the 
fit of either model, Furthermore, with certain 
research designs, graphic tests like those 
shown in Figures 1B and 2B become impos- 
sible. Hence, "representative" or improper 
"contrived" designs can exaggerate the fit of 
a seriously defective model and also can make 
it impossible to evaluate. 

The data need not be particularly unusual 
for the correlations to be so misleading. In 
Table 2, the functional scale values (marginal 
means) are linearly related to the logarithms 
of the a priori stimulus values. That is often 
the case for category ratings of psycho- 
physical stimuli or stimuli presented in 
numerical íorm, such as incomes. Conse- 
quently, there is a very real danger that the 
multiplicative model would yield higher cor- 
relations than the linear in many applications, 
even though the data were additive. 

In contrast to correlational procedures, 
functional measurement (Anderson, 1970, 
1971) requires neither a priori values for the 
stimuli, nor ratio scales for the responses. 


5 Goldberg (1971) has proposed that some of these 
problems would be avoided if a greater number of 
models were compared in any correlational contest 
of fit. For example, Goldberg's logarithmic model, 
which replaces X with log X in Equation 1, would 
have correlated perfectly with the data of Table 2. 
However, none of his five models would have cor- 
rectly diagnosed the data of Table 1. The number 
of possible correlation-regression models is impracti- 
cally large, since each arbitrary rescaling of either 
the a priori stimulus values or the overt responses 
is treated as a different model. It should be empha- 
sized that the usual applications of multiple regres- 
sion (as in the present examples) are not equivalent 
to the analysis of variance. Equivalent techniques 
(e.g., Cohen, 1968) are seldom employed in this type 
of analysis because they would be cumbersome 
to apply. 

® Functional measurement also provides for mono- 
tonic rescaling of the data under certain circum- 
stances although no nonlinear transformation was 
called for in these examples. When the dependent 
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These procedures estimate best-fit values for 
the stimuli directly from the data, using these 
estimates as the basis for statistical tests of 
the discrepancies. Functional measurement 
typically employs factorial designs, analysis 
of variance, and graphical inspection of the 
crucial features of the data. These advantages 
of functional measurement over the correla- 
tional approach are illustrated by the sample 
data in the present paper. 


variable is considered to be only an ordinal mea- 
sure of the psychological attribute, models can be 
compared by examination of critical ordinal proper- 
ties of the data (Krantz & Tversky, 1971). For 
example, the crossover interaction in Figure 1B is 
ordinally inconsistent with an additive representation. 
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SHOCK-MOTIVATED AVOIDANCE AND THE 
ANALYSIS OF BEHAVIOR’ 


DAVID S. OLTON ? 
Johns Hopkins University 


Avoidance tasks are distinguished by their spatial characteristics and by 


whether 
i for a resulting cl 


one-way active a 
used to discuss (а) 
(b) cue utilization and species differences in 


changes following brain lesions, 
gous brain structures, (e) theoretical 
system functioning. The system is u 
summarizing rel 
alternative explanations of results. 


E he proliferation of shock-motivated test- 
Ed eaque has produced many apparently 
бы epant results. Resolution of these dis- 
-repancies, based on consideration of spatial 
e ре characteristics, has been of- 
АТ, but only within a specific theoretical 
МОву, 1966), experimental (Olton & 
ан, 1968a; Theios, Lynch, & Lowe, 
EROR), or a neurophysiological (Olton & 
"ewm 1968b) framework. Consequently, 
wir Ue of. this approach has not been 
БАНИ recognized. This paper reviews the 
buses and response distinctions mentioned 
and à emphasizing their general applicability 
tizing la icating their usefulness in summa- 
ing 8 large bodies of data and in distinguish- 
among alternative theoretical explanations 
9f results, 
EU first major distinction to be drawn 
Ng avoidance tasks is that between active 
E passive avoidance (McCleary, 1961; 
"'OWrer, 1960). In the active avoidance 
шт, the animal must emit а particular 
us 486 to escape ог avoid a noxious stimu- 
» While in the passive avoidance paradigm 
— animal must not emit a particular re- 
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tively large bodies of data, 


ts 


response initiation or response suppression is required. Justification 
assification system composed of passive avoidance, 
‘oidance, two-way active avoidance, etc. This system is then 
the interrelationship of motivation and performance, 


cue utilization, (c) behavioral 


(d) comparison of behavioral roles of homolo- 
1 and conceptual models of central nervous 
seful in resolving apparent discrepancies, 


and distinguishing among 


sponse. In principle, the appropriate time for 
the response may or may not be signaled by a 
discriminative stimulus. In practice, active 
avoidance tasks almost always utilize dis- 
criminative stimuli while passive avoidance 
tasks do not and unless otherwise noted, this 
convention is followed throughout the discus- 
sion that follows. 

In the typical active avoidance task, a 
conditioned stimulus (CS) is presented in the 
presence of which the animal must emit some 
well-defined low-probability response such as 
a bar, jumping over a barrier, climb- 
ing a pole, etc., in order to avoid shock. 
Measures include the number of trials re- 
quired to perform the avoidance response 
reliably and the latency of those responses. 
In the typical passive avoidance task, condi- 
tions are arranged so that the animal has a 
high probability of emitting a response. This 
result from training or from a species- 


pressing 


may 
typical response such as stepping off a small 
platform, entering a dark compartment, 


leaving a shaking bar, etc. At some time arbi- 
trarily chosen by the experimenter, this re- 
sponse is punished. Passive avoidance mea- 
sures include a decrease in the probability 
and an increase in the latency of response. 
The terms “active” and “passive” are used 
only to define the procedures just described; 
no other connotations are intended, Also, these 
terms refer only to the experimenter-defined 
response. During any avoidance training a 
variety of behaviors are observed. In passive 
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avoidance the animal rarely does nothing as 
the term might imply; rather, it engages in 
a series of approach-avoidance and displace- 
ment activities, Similarly, in active avoidance 
the animal does not just emit the correct 
response but suppresses some (such as 
freezing) and engages in others (such as ex- 
ploration). In short, the terms “active avoid- 
ance" and "passive avoidance" do not com- 
pletely describe the animal's behavior. None- 
theless, it is difficult to find more appropriate 
terms, and this paper will continue to use 
the active-passive terminology. 

A further division of active avoidance tasks 
into one-way active avoidance and two-way 
active avoidance completes the classification 
System. In one-way active avoidance the ani- 
mal responds in only one direction, leaving a 
consistently noxious location and entering a 
consistently safe location. In two-way active 
avoidance, there are no consistently safe and 
noxious locations. The animal must learn to 
respond in two directions, leaving one location 
and entering the other on one trial, and leav- 
ing the latter and entering the former on the 
next trial, " 

The one-way, t -way classification re- 
quires that an avoidance task have exactly 
two locations; not all avoidance tasks do. 
There may be only one location as in a bar- 
press avoidance task, or there may be several 
locations, as in a sequential avoidance task. 
Although both types of tasks may be ana- 
lyzed in terms of one-way and two-way com- 
ponents (see p. 245), neither fits into the 
strict categorization given above and conse- 
qnently is place into an “other” category. 


MOTIVATION AND PERFORMANCE: 
STIMULUS INTENSITY 


Tn the present context, the general problem 
concerning the interrelationship between in- 
tensity of motivation and efficiency of learn- 
ing or performance becomes: How does 
variation of shock intensity affect the rat's 


316 is suggested that the term "shuttle box" be 
discontinued because of the confusion between 
"shuttle box avoidance" and "shuttle box apparatus." 
The former refers to a two-way procedure while the 
latter refers to a two-compartment apparatus in 
Which either one-way or two-way aclive avoidance 
may be conducted, 
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avoidance behavior? In both one-way active 
avoidance and passive avoidance, the rate of 
acquisition increases or does not change as 
shock intensity is raised from .5 milli- 
ampere to 3.0 milliamperes (Kurtz & Pearl, 
1960; Moyer & Korn, 1966; Theios et al., 
1966). In two-way active avoidance, just 
the opposite is true. With shock intensities 
greater than .5 milliampere, the rate of 
acquisition of two-way active avoidance 
steadily declines as shock intensity rises 
(Beatty & Beatty, 1970; Davis, Mook, & 
Brush, 1958; Johnson & Church, 1965; 
Kurtz & Shafer, 1967; Levine, 1966; Martin, 
Powell, & Kamano, 1967; McAllister & 
McAllister, 1971; Moyer & Korn, 1964; 
Overmier & Seligman, 1967; Theios et al, 
1966). 

An appropriate post hoc explanation for 
the differences just described lies in the con- 
flict characteristics of the two tasks. The 
one-way task is conflict free. One location 15 
always noxious, the other is always neutral. 
Raising shock intensity ought to increase the 
aversive character of the former while 
leaving the neutral character of the latter un- 
affected. The animal should be more likely 
to move from the shocked to the unshocked 
location, The two-way task is not conflict 
free. In order to avoid shock on any give? 
trial, the animal must return to a location 1 
which it was shocked on the preceding trial. 
The resulting outcome is an avoidance" 
avoidance conflict. Raising the shock inten 
Sity ought to increase the conflict, a Lg 
cedure which often produces maladapt 
stereotyped behaviors which interfere wit? 
the correct response. Other analyses, in -— 
detail, have been presented by McAllister anc 
McAllister (1971) and McCleary (1966). 

At extremely high intensities of shock, a 
monotonic interrelationship between shock e 
tensity and rate of acquisition may e 
down. Dogs tested in two-way active pe 
ance with "subtetanizing" (5-10 ™ Я 
amperes) levels of shock (Solomon & Wy 
1953) attained criterion in very few d 
Although there is no comparable pa 
avoidance experiment, an inversion of 
monotonic trend might be found here also: 
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Fic. 1. Schematic representation of the four avoidance procedures discussed 
in Olton and Isaacson (1968a). (The arrows indicate direction of movement 
and the shading differentiates the two compartments. Note that in the sequen- 
tial avoidance procedure the white compartment, containing the animal, is not 


moved during the intertrial interval.) 


Cur UTILIZATION AND ATTENTION 


Е зао reaction to certain stimulus 
Саас „апі species differences in this 
сїй, b eristic are two other problems which 
Way е dealt with by consideration of one- 
this and two-way active avoidance. Up to 
cate on no attempt has been made to indi- 
bcati e characteristics distinguishing one 
tant % from another. This issue is impor- 
about ecause (a) it provides information 
t cue preferences of different species and 
ae determines the classification of a par- 
ae ee as either one-way or two- 
= Ive avoidance. 
га. Чу dealing directly with this issue 
avoida ed performance in four different active 
nee tasks (Olton & Isaacson, 1968a). 
- ed and two-way procedures were con- 
remai as usual with the two compartments 
aming in their respective spatial locations. 
d the cues for spatial location and com- 
ie Т identification were opposed. In 
(cy partent diiit avoidance procedure 
of hich P 1966) the compartments „(опе 
с айрый. contained the animal) were inter- 
Which p at the end of the intertrial interval 
With те esulted in one-way active avoidance 
üctive Spect to spatial location but two-way 
Ments | Sige with respect to compart- 
e co n the sequential avoidance procedure, 
aheaq ™Mpartment just electrified was moved 
ani the compartment containing the 
his task was again two-way with 
compartmental cues, but differed 
Oth the one-way and two-way tasks 


with respect to spatial cues. Figure 1 illus- 
trates these procedures. 

Results indicated that performance in 
tasks spatially equivalent (compartment shift 
and one way) was similar, while performance 
in tasks spatially distinct (compartment 
shift, sequential, and two way) was dif- 
fent. Although compartmental cues are not 
irrelevant in avoidance performance (Denny, 
Koons, & Mason, 1959), the data suggest 
that rats preferentially associate shock with 
a given location rather than with coextensive 
visual and auditory stimuli. 

Are other species as dependent upon spatial 
cues in the performance of avoidance tasks? 
For cats, the number of trials to criterion in 
compartment-shift avoidance is almost as 
large as that observed in two-way active 
avoidance (Lubar & Perachio, 1965; Mc- 
Cleary, 1966). These data suggest that the 
cat treats the compartment-shift task as dif- 
ferent from one-way active avoidance and 
similar to two-way active avoidance. A pos- 
sible complication lies in the method used to 
move the compartments (Kriekhaus, 1965) 
but other experiments (Baum & Bobrow, 
1966) suggests that this may not be a con- 
sideration. 

Additional data come from a comparison 
of the normal two-way active avoidance pro- 
cedure with ones in which the location of the 
conditioned stimulus (CS) is changed from 
trial to trial so that the task is two way with 
respect to spatial location and compartment 
cue but one way with respect to the CS. The 
performance of cats is substantially affected 


by having an auditory CS presented in this 
one-way fashion (McAdam, 1964) while the 
performance of rats is not (Whittleton, Kos- 
tanek, & Sawrey, 1965). Again the data sug- 
gest that cats do not preferentially attend to 
cues regarding spatial location. Monkeys 
probably also attend to auditory and visual 
cues in such a procedure, rather than to 
spatial ones (Krieckhaus, 1967). 

In summary, there are a variety of cues 
attended to during avoidance training (cf. 
Rozin and Kalat, 1971) and these vary from 
species to species. More direct tests concern- 
ing the importance of various sensory modali- 
ties in avoidance learning in animals other 
than the rat would be helpful for under- 
standing the behavior of the species in ques- 
tion, and for assistance in categorizing a 
particular avoidance procedure as either 
one-way or two-way active avoidance. 


BRAIN LESIONS AND AVOIDANCE BEHAVIOR 


Early studies (Papez, 1937) correctly 
suggested that the neuroanatomical bases of 
emotional behavior were in subcortical, espe- 
cially limbic structures, but it was only with 
subsequent testing in shock-motivated avoid- 
ance procedures that finer distinctions could 
be made. One well-known study (McCleary, 
1961) demonstrated that septal lesions im- 
paired passive avoidance but not active 
avoidance, while cingulate lesions impaired 
active avoidance but not passive avoidance. 
Other experiments have corroborated these 
distinctions for the two brain areas men- 
tioned and have extended them to many other 
subcortical structures. As will be indicated, 
these results provide additional support for 
the classification of avoidance tasks pre- 
sented here and allow finer distinctions to be 
drawn among the behavioral roles of the 
neuroanatomical substrates of emotion. 

Much of the material relevant to this issue 
has been covered in detail in recent review 
articles, Consequently, in the ensuing discus- 
sion reference is made to review articles 
rather than to original sources whenever pos- 
sible. Except where indicated, only data from 
rats is considered, 

There is little systematic information con- 
cerning the results of neocortical lesions on 
avoidance behavior. With small lesions of 
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the posterior cortex, rats appear to be 
slightly impaired in the acquisition and re- 
acquisition of both one-way and two-way 
active avoidance tasks, exhibiting longer re- 
sponse latencies (Olton & Isaacson, 1968b; 
Saavedra, Pinto Hamuy, & Oberti, 1965). 
Passive avoidance is probably unaffected 
(Isaacson, Olton, Bauer & Swart, 1966; Tsaac- 
son & Wickelgren, 1962; Kimble, 1963; Kim- 
ble, Kirkby, & Stein, 1966; Olton & Isaacson, 
1968b). In the cat, lesions of the lateral gyrus, 
posterolateral gyrus, and occipital cortex 1m- 
pair acquisition of two-way active avoidance 
(Lubar & Middaugh, 1971; Lubar, Perachio, 
& Kavanagh, 1966; Lubar, Schastal, & 
Perachio, 1967). 

Lesions in the dorsomedial nucleus of te 
thalamus produce an impairment in acquist- 
tion of both one-way and two-way active 
avoidance, but have no effect upon acquisition 
of passive avoidance. Postoperative retention 
in the absence of shock is unaffected E 
& Isaacson, 1967). This suggests that ins 
behavioral change is in response to the п 
ceptive somatic stimulus (pain) rather ү, 
to an internal representation (fear). For if 
review of this literature, see Vanderwo 
(1971). 

Other thalamic nuclei are also involve“ is 
avoidance behavior. Small lesions of the pe 
line nuclei impair acquisition of O 
active avoidance (Bohus & de Weid, 960 
and reacquisition of one-way 
ance, Thompson, 1963) but escape 
remains unchanged, After desane ey А 
parafascicularis nucleus, rats are de unim- 
acquistion of one-way avoidance, but sull 
paired in acquisition of two-way active t 
ance (Bohus & de Weid, 1967). 1651075 ye 
the anterior thalamic nuclei, part cqui- 
original Papez “circuit,” interfere s etr | 
sition of passive avoidance (Ger way 
1965) but not with the retention of ОП 


1 in 


nucleus do not alter avoidance ORS) 
(Olton & Isaacson, 1967; Thompson, a ui- 
Hippocampal lesions in rats ише $ 
sition of two-way active avoidance es 
slightly impair or do not affect pacc ret in 
one-way active avoidance. An impair" fou” 
passive avoidance may or may not be 
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depending upon the characteristics of the 
task (Nadel, 1968). For reviews of this lit- 
erature, see Douglas (1967) and Olton and 
Isaacson (19681). 

Septal lesions in rats produce behavioral 
Changes similar to those found айег hippo- 
campal lesions. Acquisition of passive and 
one-way active avoidance is impaired while 
acquisition of two-way active avoidance is 
enhanced. In cats, septal lesions impair acqui- 
Sition of passive avoidance and enhance ac- 
quisition of compartment-shift avoidance. The 
changes in passive avoidance are independent 
of altered regulatory function and the changes 
in active avoidance are independent of the 
Septal hyperreactivity snydrome. For reviews, 
See McCleary (1966) and Thomas, Hostetter, 
& Barker (1968). | 

'The effects of amygdaloid lesions in rats 
Upon avoidance behavior are not clear, partly 
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Avoidance 
NEOCORTEX — 
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Anterior nucleus | 
Dorsomedial nucleus — 
Midline nuclei | 
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SEPTUM | 
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CAUDATE NUCLEUS | 


MAMMILLOTHALAMIC 
TRACT 


Fic. 2. Avoidance per 
greatly facilitated performance 
in performance =; slightly 
formance J. A solid arrow 
arrow indicates confusion. 


formance after brain 
1 eft slightly facilit 
impaired performance No: # 
indicates consistency in th 
No arrow indicates the absence © 


due to the lack of agreement upon appropri- 
ate subdivisions of the amygdaloid nuclei. 
There are no reports of facilitation of avoid- 
ance behavior after lesion or stimulation of 
the amygdala, while reports of deficits are 
common, Additional data are needed before 
a definite statement can be made about the 
specific pattern of performance changes ex- 
hibited on different avoidance tasks (see 
Goddard, 1964). 

Caudate lesions in rats impair acquisition 
of one-way active avoidance and of trained, 
but not untrained, passive avoidance (Kirkby 
& Kimble, 1968). In cats, caudate lesions 
impair passive avoidance (Fox, Kimble, & 
Lickey, 1964) and two-way active avoidance 
(Thompson, 1959; Thompson & Mettler, 
1963). 

Mammillothalamic tract lesions have little, 
if any, effect upon the acquisition of either 


One -way Two- way 
Active Active 
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passive avoidance, one-way active avoidance, 
or two-way active avoidance. Acquisition of 
more complex avoidance tasks, as well as 
retention of two-way active avoidance and 
of discriminated avoidance is impaired. For 
reviews, see Krieckhaus (1967). 

Figure 2 summarizes these data, illustrating 
what is known about the performance of 
animals with particular brain lesions, and 
what has not yet been learned, either because 
there are no experimental data or because 
the data are conflicting. 

Two conclusions may be drawn from these 
experiments, First, the behavioral effects of a 
particular central nervous system lesion vary 
depending upon the type of avoidance task. 
Second, the pattern of performance across all 
three types of avoidance tasks varies with 
the site of the lesion. These results indicate 
that the neuroanatomical mechanisms used 
during avoidance training differ from one 
type of avoidance task to another. These 
conclusions provide neuroanatomical support 
for the classification system proposed here 
and allow further distinctions to be drawn 
among the behavioral roles of different sub- 
cortical structures thought to be important in 
the elaboration of emotions. 
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Although further subdivision of the brain 
structures summarized is probably appropri- 
ate (Grant & Jarrard, 1968; Van Hosen, 
MacDougal, & Mitchell, 1969), there is not 
yet sufficient information about animals with 
more discrete lesions to include them 1n 
Figure 2. As more data become available, the 
same type of analysis applied to the larger 
structures should also be of value for 
investigation of their smaller subunits. 
Comparative Studies 

To determine whether or not homologous 
brain structures have equivalent behavioral 
functions, cross-species comparisons and cor- 
rect classification of avoidance tasks are гч 
quired. This procedure is made difficul 
because a given objective procedure may not 
represent the same kind of task for дее 
species. Rats treat compartment-shift avoit 
ance as if it were a one-way task; cats may 
or may not (see p. 246). Septal lesions a 
rats impair one-way active avoidance; зр 
lesions in cats do not affect compartment-shi x 
avoidance. If compartment-shift avoidance 
for cats is equivalent to the one-way er 
dure for rats, then the spetal nuclei in thes 
species do not have equivalent behaviora 
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ACTIVE ACTIVE 
AVOIDANCE AVOIDANCE 


^ A 


^ ۸ 
Y v 


and the patterns of performance that 
them. 
performance î, no change in performance —, an 


(The arrows indicate: enhanced 


d impaired performance 4.) 


| 


AVOIDANCE BEHAVIOR 


functions, If, on the other hand, the com- 
partment-shift procedure for cats is a type 
of hybrid between one-way and two-way 
avoidance, these homologous brain structures 
may have equivalent functions. Similar prob- 
lems arise in consideration of cross-species 
comparisons after lesions of the mammillo- 
thalamic tract (Krieckhaus, 1967) and the 
hypothalamus (Ingram, 1958; McAdam & 
Kaelber, 1966), and further experiments are 
necessary to settle the issue. 


THEORETICAL AND CONCEPTUAL MODELS 


The avoidance analysis presented here is 
useful in summarizing data from experiments 
Hsing brain lesions as well as offering sug- 
gestions as to the psychological processes that 
might have been altered as а result of the 
lesion. Some hypotheses and related perform- 
ance patterns are presented in Figure 3. The 
rationale for most of the patterns is clear. 
Increased activity ought to facilitate acqui- 
sition of both types of active avoidance be- 
Cause it should decrease the probability 
of maladaptive freezing responses; passive 
avoidance would be affected only if the activ- 
ity Was goal directed. Changes in response 
facilitation and response inhibition, used in 
Conceptual rather than descriptive context, 
ought to produce the patterns indicated. 

Iwo relationships that might not be obvi- 
OUS are: increased fear, two-way active avoid- 
ance impaired, and decreased fear, two-way 
active avoidance enhanced. Evidence has been 
Dresented (see p. 244) that increasing shock 
Intensity decreases the rate of acquisition of 
two-way active avoidance. If fear is posi- 
шу correlated with shock intensity; in- 
mre eri i dere 
fear oye ctive avoidance W g 

ught to accelerate it. 

x de hypotheses and patterns of perform- 
tend Eie: of course, possible. Figure 3 i = 
ж» | only to illustrate the advantage 4 he 
m proposed here. By equating all ex- 
Whi mental parameters except the manner in 

ich the shock is applied, undesirable arti- 
o may be eliminated. Both for the genera- 
in. new hypotheses and for E a 
Patten of already proposed acm Lar 
Offers п performance on these 5 

à rapid and reliable procedure. 
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When these theoretical patterns of perform- 
ance are compared with those actually ob- 
served after brain lesions, it is apparent that 
changes in activity are probably not the cause 
of avoidance changes following any brain 
lesions, although they may be observed as 
independent (Douglas, 1967) or dependent 
(Krieckhaus, 1967) effects. Increased fear 
seems unacceptable also. Decreased fear is 
compatible with the results of septal lesions 
(Kenyon & Krieckhaus, 1965; Lubar, 1964) 
and hippocampal lesions (Olton & Isaacson, 
1969). Decreased (voluntary) response facili- 
tation is compatible with the results of dorso- 
medial thalamic lesions (Vanderwolf, 1971), 
while a change in the ability to shift from 
stereotyped to novel responses has been pro- 
posed for the results observed after mam- 
millothalamic tract lesions (Krieckhaus, 
1967). Pending appropriate classification of 
the compartment-shift task for cats, either 
decreased response inhibition or decreased 
{еаг might be appropriate explanations for 
the behavioral changes observed aíter septal 


lesions. 
SUMMARY AND CONCLUSIONS 


Justification has been offered for classifica- 
tion of avoidance tasks into passive avoid- 
ance, one-way active avoidance, and two-way 
active avoidance. This analysis has been ap- 
plied to five different areas, and the following 
conclusions drawn: (a) the amount of conflict 
present in a task influences the relationship 
between intensity of motivation and rate of 
learning; (b) cues concerning spatial loca- 
tion in avoidance behavior are dominant in 
the rat, but probably not in the cat or 
monkey; (c) the functional role of sub- 
cortical structures thought to be involved in 
emotional behavior can be reliably differenti- 
ated; (d) appropriate classification of avoid- 
ance tasks is mandatory in determining 
whether homologous brain structures have 
equivalent behavioral functions; (e) theo- 
retical and conceptual models of central 
nervous system functioning can be verified 
by comparison of performance on these 
three tasks. 

Criticism has been made of experiments 
utilizing shock avoidance procedures because 
these behaviors are “unnatural” for animals 
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(Bolles, 1970). The evidence presented here 
suggests that this is not the case. Certainly 
an animal in the wild rarely, if ever, experi- 
ences electric shock, set intertrial intervals, 
etc, Nonetheless, the generality of the ap- 
proach proposed here indicates that appropri- 
ate shock motivated procedures utilize brain 
systems naturally designed to cope with 
aversive events. The distinctions drawn here 
among avoidance tasks appear to be signifi- 
cant both for the animal's behavior and for 
its central nervous system, suggesting that 
these procedures are appropriate for investi- 
gation of aversively motivated behaviors and 
for understanding the central nervous system 
mechanisms that underly them, 
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THE BOGUS PIPELIN 


A NEW IGN. 


THOMAS M. OSTROM ? 


Ohio State University 


The bogus pipeline has been offered by Jones and Si an i 
on previous rating procedures. Four areas of possible superiority were ex- 
amined to determine where the pipeline’s strengths and weaknesses lie. The 
pipeline was compared to the conventional rating scale in terms of relative 
sensitivity, functional comparability, ethical character, and practicality. It was 
concluded that, with the possible exception of improved sensitivity, the pipe- 
line’s superiority was not proven on the basis of the evidence offered. 


At night over swampy ground there occa- 
sionally appears a phosphorescent light that 
behaves in a mysterious manner, Sometimes 
it hovers and sometimes it flits about; and 
when one approaches it, the light just seems 
to disappear. The usual explanation of this 
mischievous light is that it results from spon- 
taneous combustion of swamp gas generated 
by rotting organic matter. Since this light 
would occasionally mislead travelers, it came 
to be known as ignis fatuus, a medieval Latin 
term that translates to “foolish fire.” 

The history of attitude measurement in 
social psychology has a similar will-o’-the-wisp 
character, Just when it seems a satisfactory 
technique is within our grasp, it manages to 
elude us, Flaws are invariably found and the 
search begins anew. In pre-Thurstone days 
(prior to 1928) many believed that any at- 
tempt to quantify attitudes was futile. Vet 
the need was great and Thurstone, encounter- 
ing only slight resistance, convinced research- 
ers that attitudes can be measured. But satis- 
faction with these techniques was short lived, 
for just as experimental attitude research 
was getting under way at the end of World 
War II, doubts began to emerge regarding 
their fallibility. Early concerns with the possi- 
bility of distortion and dissimulation in verbal 
responses has today become a preoccupation 
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all as an improvement 


in attitude measurement (Cook & E 
1964; Rosenthal & Rosnow, 1969). ona 
techniques, even nonverbal ones such as ae 
pillary response, have had an even П 
ephemeral existence. F 
P ne of the simplest and most commonly 
employed attitude measurement procec p? 
is the self-rating scale. This procedure aost 
been singled out as the one technique p 
vulnerable to bias and error (Cook & Se es 
1964). A new rating scale procedure p 
cently been proposed by Jones and REUS 
(1971) that, through ingenious develop! ЫШ 
of format and instructions, seemingly dime 
solution to some of the more serious wd т 
biases, So promising do Jones and Pg have 
gard this self-rating procedure that pey tha 
termed it the “bogus pipeline,” a tem direc 
suggests the procedure might provide : 
and unobstructed pipeline to the 5017. Jine 15 
The basic format of the bogus " - 
comparable to most self-rating mE with 
jects indicate their attitude on а ae able oF 
end labels such as favorable-unfavora rating 
agree-disagree. Rather than making his whee 
by pencil, the subject turns a steerin* 
that controls a pointer on the rà 
which is mounted on a console in 
him, The crucial difference between ү 3B 
the typical procedure is the set which а Fiat 
brings to the rating task. He 1s 


ting 5€ 

front 

his a” 

told t 

3 » h 
machine exists (the electromyograp! 4 
capable of measuring implicit um S 
tials when attached by electrodes pands 
son’s forearms. When seated with D 
the steering wheel, the subject's usn 
to turn the wheel to a particular in ine, 
gory could be detected by the mach 

252 


COMMENTARY ON THE BOGUS PIPELINE 


when the wheel itself is locked in place. By 
analyzing these implicit muscle potentials the 
machine can determine the subject's true atti- 
tude on the issue. 

Although this cock-and-bull story is ap- 
parently greeted with skepticism by most sub- 
jects, a set of deceptive demonstrations is 
presented to convince the subjects that the 
machine indeed does accurately record their 
true attitudinal inclinations. This is accomp- 
lished by obtaining the subject's self-rating 
cum variety of topics ahead of time in a 
ee arouses no suspicion and then 
Lud an accomplice use this information to 

ntrol the machine's feedback meter when 
the subject is seated before it. 

Tage is of the machine's infallibil- 
Жу jects are told that they are being 
бу, Ar ues how well they understand their 
ih EM “whether they were In touch 
хага е they felt.” They are asked to esti- 
ed neir own attitude by using the steering 
from, while the feedback meter is concealed 
am their view, Their self-ratings are to be 

pared with their true feelings as measured 

y the machine. 
is а bogus pipeline is not a test as that term 
ime used in attitude and personality 
ane i ee Rather, it is a response format 
ployed 1 instructional set that could be em- 
sed in a variety of such tests. It could be 
аш е the agree-disagree scale for Likert 
SB ide tests or for such scales as the Minne- 

Multiphasic Personality Inventory 


EMI) and F tests. The pipeline is amena- 


bl 

€ to a ite 

ble to a forced choice format such as 15 used 
ld be used 


i 

e Rotter I-E scale, or it cou Е 

man ле bipolar ratings required by the Se- 

m Differential. 

ER pipeline to be adopted аз а standard 

Comp; for the self-rating of attitudes, its 

is BI with (or superiority over) ex- 

Offere methods must be established. The data 
ed to support its relative merit will be 


dise 
1 В 
15е in the next several sections. 
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ing procedures, it seems of prior interest to 
establish the concordance of the two ap- 
proaches. 

Convergence of the pipeline with ordinary 
ratings can be established in two ways. Given 
that they measure, in part, the same construct 
people who score high on one should tend to 
score high on the other. Second, situational 
influences that produce a change for one 
should, if strong enough, lead to change in 
the other. Convergence across individual dif- 
ferences was shown in a study by Jones, Bell, 
and Aronson (1972). They correlated attrac- 
tion toward a confederate as expressed by the 
pipeline with a 12-item paper-and-pencil rat- 
ing scale. The within-cell correlations from 
their two experiments were .66 and .81, a 
satisfactory level of convergence. 

Jones et al. (1972) manipulated whether 
the confederate had expressed liking or dis- 
liking toward the subject, a variable known 
to reliably affect attraction. This situational 
variable significantly affected both the pipe- 
line and the rating measures of attraction in 
both experiments. As would be minimally ex- 
pected then, these gross situational and indi- 
vidual differences showed a similar pattern 
for each of the two measurement procedures. 

One argument for preferring a newly intro- 
duced attitude measure over existing measures 
is that it contains less error variance in the 
estimate it provides of each person’s attitude. 
The availability of a more sensitive measure 


would permit the researcher to detect subtle 


effects that might otherwise go unrecognized. 
tandard effects, 


And, when studying more 5 
investigators could use smaller sample sizes if 
the measure were known to be less error rid- 


den. 

Several studies were reviewed by Jones and 
Sigall that indicated the pipeline’s sensitivity 
might be superior to the rating scale. In one 
paper (Jones et al, 1972), a predicted inter- 
action (between what variables is immaterial 
for present purposes) was tested in two ex- 
periments. Jones and Sigall suggested that the 
pipeline provided a truer portrayal of the sub- 
jects’ feelings in these studies because the 
pattern of means for the pipeline was closer 
to that predicted than was the pattern of 
rating measures. But since the predicted in- 
teraction did not reach statistical significance 


ior either measure in either experiment, this 
evidence cannot be given much weight. 
Data of a more convincing nature were ob- 
tained by Cooper (1971). A dissonance pre- 
diction involving a three-way interaction was 
confirmed when using the pipeline procedure 
but was not significantly supported when us- 
ing paper-and-pencil ratings, But as Jones 
and Sigall noted, there was a possibility of 
the ratings being influenced by their having 
lowed the pipeline in the sequence of data 
ection. A preferable test of differential 
* sensitivity would involve half the subjects 
| receiving one dependent measure and half the 
other measure. 
just such a test was undertaken in an un- 
published study by Sigall and Page that was 
described in Jones and Sigall (1971, p. 358). 
Subjects evaluated an obnoxious person who 
was either physically normal or wore leg 
braces (stigmatized). Half of each group used 
the pipeline and half used a comparable rat- 
ing procedure. This stigma manipulation did 
not produce a significant main effect nor did 
it interact with the nature of the dependent 
variable when looking at overall attraction. In 
terms of detecting differences due to stigma, 
both measures agreed that no significant dií- 
ferences existed. It appears that neither this 
study nor the two previously described offer 
conclusive support for the proposition that 
the pipeline is more sensitive to attitudinal 
influences than is the ordinary rating scale. 
One potential advantage to the pipeline 
was shown in this Sigall and Page study. 
Whereas subjects were reluctant to give an 
obnoxious person a negative evaluation when 
using the rating scale, this positivity bias 
disappeared when the pipeline was used. Al- 
though this difference in scale origin may be 
important for determining the absolute num- 
ber of pro- and antirespondents, it is irrele- 
vant when testing relational hypotheses. 


FUNCTIONAL COMPARABILITY 


Should the pipeline be shown to possess 
greater sensitivity than a comparable rating 
scale, this would not constitute an argument 
against using the rating scale. It would only 
mean that larger sample sizes and stronger 
manipulations would be required with rating 
procedures than with the pipeline. | 
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But the pipeline may have а more impor- 
tant advantage. To the extent that it offers а 
purer measure of attitude, the pipeline should 
provide results more in accordance with real- 
ity (and the predictions of accurate theory) 
than ordinary rating scales. Use of the con- 
ventional rating scale in attitude research, 
then, could lead to spurious validation. 0i 
erroneous hypotheses. Circumstances muy 
exist where the biases of the rating scale lead 
to a functional relationship that is different 
in shape and direction than would be ob- 
tained with a less biased measure of attitude. 
If this attribute of the pipeline could be con- 
vincingly demonstrated, it would constitute а 
powerful argument in the pipeline's favor. 
Further, such a result would cast serious sus- 
picions on the validity of a great number of 
past empirical relationships that were basec 
on the rating scale. If it is found that тағ 
searchers have been misled in their hypothes' 
testing using conventional rating procedures, 
then it would become important to devise 
new measurement procedures, such as the 
bogus pipeline, that promise a truer view i 
which independent variables control attitude. 

No support for this potential asset was 
provided in the three already described stud- 
ies reviewed by Jones and Sigal] (1971). 
Findings that were significant for one measure 
showed the same pattern of condition nieans 
for the other measure, There was no sis?! a 
cant tendency for one to show a different 
functional relationship from the other. d 

A study by Sigall and Page (1971) oed 
the prediction that social desirability in 
ences on the rating scale would be HU. 
when the pipeline was used. White A 
subjects rated one of two ethnic groups "^ 
groes or Americans, on the degree to hoe 
each of 22 traits characterized the Lkw 
Half of the subjects responded using the ae 
line paradigm and half, although using 


і f аю. бей 
same steering wheel rating device, Were э 


ordinary rating scale instructions. ding 

Upon frst appraisal, some of the f tion 
of this study appeared to support the ie pe 
that different functional relations WO ed- 
obtained, depending on the measure was 
For example, one of the traits, honest, pan 
judged more characteristic of Negroes “Jess 


f м > И it 
Americans in the rating condition m 
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the c ama » Judged, 11 were identified by 
Portive of the p Fielding an interaction sup- 
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by the ا‎ ability bias not shared 
fied the Ene Fhese results seemingly veri- 
aws in the s the pipeline to eliminate 
ating scale. 
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study ж examination, however, this 
Superiority cp iic er ig substantiate the 
Actions, serio the pipeline. Of these 11 inter- 
only 6, as a consideration can be given to 
Cred fully remaining 5 cannot be consid- 
oie oii fabae ea ie). Ee 
specially ignificance level should have been 
univariate conservative in this study as 22 
ducted on € of variance were con- 

Not all nr observations. 
ere produce г he remaining six interactions 
direction of d by an actual reversal in the 
e ls. functional relationship be- 
Шр di. and judgment. If a trait is 
ethnic 
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эшенэ used, it 15 wrong to conclude the mea- 
sures lead to different functional relati i 
NC ANA nal relationships. 
| sui nt procedures showed 
aie ee relationships for 2 of the traits 
aving int 
germ ا ا‎ rd ^ ci showed 
A second Basis ES E d па 
Page results should b n sf. m E 
ge: SUIS: e evaluated is the com- 
parability of their findings with the results of 
other research on stereotyping. Sigall and 
Page borrowed their materials from a study 
conducted 2 years earlier by Karlins, Coffman 
and Walters (1969). Karlins et sf екаш 
the percentage of subjects who checked each 
trait as being characteristic of each ethnic 
group. The preferred comparison would be 
between the mean response differences. (be- 
tween Americans and Negroes) obtained by 
Sigall and Page and the percentage differ- 
ences for Karlins et al. 
Unfortunately, precise comparisons between 
the two studies are difficult because the Kar- 
lins et al. paper did not report the percent- 
ages of all traits for both ethnic groups.” They 
only provided data for the adjectives that 
received the highest percentage of “character- 
istic” judgments for each ethnic group, and 
there was little overlap in the top traits listed 
Americans and Negroes. Although the ac- 


for 

tual percentage differences cannot be com- 
puted, the minimum percentage differences 
can be derived for 13 of the 22 traits. Nine of 
these 13 showed a significant ethnic group 
main effect for Sigall and Page along with 
either insignificant interactions OF interac- 
| ersal of the 


id not involve à rev 
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et al. data was in com- 
measurement 
Page, the di- 
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plete accord. S0, when both 
procedures agreed for Sigall and 
rection of jonship exac 
with that reported by Karlins et al. | 

Although Sigall and Page found no signifi- 
cant ethnic group difference or interaction for 
f the remaining four comparison traits, 
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means are in better agreement with the Kar- 
lins et al. results than were the rating means 
‘for both these traits (see Table 2). 

Of the remaining two traits for which com- 
parable data were available, one showed a 
small minimum percentage difference (2%) 
and the other a somewhat larger difference 
(11%). Sigall and Page found a significant 
interaction for each in which the ethnic 
group differences were in opposite directions 
for the two dependent measures (see Table 
3). For the trait with the smallest minimum 
difference, sensitive, the direction of its dif- 
ference coincided with the direction of the 

rating scale means in the Sigall and Page 
study. Alternatively, the trait with the larger 
percentage difference, lazy, was in the same 
direction as the pipeline means. It ap- 
pears from these cross-study comparisons that 
when discrepancies exist between the ethnic 
group findings of the two studies, the Karlins 
et al. data are more likely to conform to Sigall 
and Page's pipeline condition than their rat- 
ing scale condition. 

An index of stereotype favorability was 
computed by Sigall and Page which showed 
that the stereotype held of Americans was 
more positive than it was of Negroes. Al- 
though this difference was observed for both 
the rating and pipeline conditions, it was sig- 
nificantly more pronounced with the pipeline 
measure. In fact, compared to the pipeline 
d'fference of .81 scale units, the rating differ- 
ence of .04 units was negligible. Karlins et 

al. also calculated an index of stereotype 
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ositive difference indicates the trait was judge 
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ences could not be calculated. 


favorability and they, too, found that the 
stereotype held of Americans was more post- 
tive than was the stereotype of Negroes. 50 
once again a comparison across the two stud- 
ies shows Karlins et al. data to be more simi- 
lar to Sigall and Page’s pipeline results than 
to their rating differences. 

This all suggests a rather startling олы 
sion. Sigall and Page's pipeline condition dic — 
not eliminate bias that existed in previous 
findings, but rather their rating condition 1- 
troduced a bias that was not present in the 
Karlins et al. study. Rather than demon- 
strating the superiority of the pipeline, they 
instead have shown that it is possible to p 
sent the self-rating task in a way that тах 
mizes biased responding. However, when ^ 
propriate precautions are taken with аи 
rating procedures, as was done in the Ka! ait 
et al. study, the rating scale and pipeline ? 
comparable. 


р А у i here 
Usually ratings are made in a setting W 


ind 
no one else observes the judgments Heng 
recorded. Tf the rater is in a large grou ls 
respondents, he may well assume ш à 
responses are not going to be singled yen is 
special attention. Often times the A 
not even required to place his name Г tions 
rating materials, Finally, except 1n sm ied 
where experimenter bias is being ardize 
stimulus presentation is usually stanc po 
through using slides, tape recordings, 
printed page. | 1 Page 
The rating condition of the Sigall ан оп. 
study incorporated none of these ewe in 
Subjects made their ratings individual» 
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the presence of the experimenter and were 
required to announce each aloud before pro- 
ceeding to the next trait. In addition, the 
experimenter orally presented the 22 traits to 
each subject, thereby leaving open the pos- 
sibility of experimenter bias through voice in- 
flection and emphasis. 

It must be concluded that the evidence 
Teviewed by Jones and Sigall (1971) does not 
E = notion that the pipeline leads to 
tion ^e unctions differing in shape or direc- 
EA om those obtained with more mundane 

E procedures. 
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individual. To the extent that a person's 
emotional stability is contingent on such be- 
liefs and feelings being suppressed from con- 
sciousness, the pipeline, by forcing these re- 
sponses into consciousness, could disrupt such 
adaptive patterns, 

But for even the mentally healthy subject, 
the pipeline represents an ethical hazard. It 
involves an unusual form of double deception. 
Most deceptions are characterized by the sub- 
ject merely “going along” with the experi- 
mental manipulations. He does not actively 
disbelieve what he is told, for the scenario is 
designed to fit in with the subject’s prior con- 
ceptions of the world. On the other hand, 
the pipeline presents the subject with an unbe- 
lievable situation, one that goes against his 
common sense. Yet by ingenious duplicity, 
the experimenter is able to overcome the sub- 
ject’s initial skepticism. 

Then comes the debriefing. With many de- 
ceptions, a subject's self-es eem is little af- 
fected since the situation seemed reasonable 
and plausible from the very beginning. Any- 
one would have taken it the way he did. But 
it is conceivable that the subject’s reaction to 
discovering the deception of the pipeline is 
quite different. In accepting the deception, he 
must acknowledge that his own ability to 
defend his prior notions of reality was lacking 
and that the experimenter was able to put one 
over on him. The subject might prefer to be- 
lieve at this point that the machine really 
could measure his attitudes rather than his 
having to yield once again to a new set of 
arguments presented him by the experimenter. 
To go along with those arguments would mean 
that he had twice been unable to defend con- 
victions that he honestly and firmly held. 
Jones and Sigall (1971) reported that sub- 
jects were “quite surprised” to learn that the 
pipeline was bogus. Subjects may leave the 
experiment not knowing which of the two 
stories to believe. 

Whether this deception actually does affect 
subjects’ self-esteem or faith in human nature 
(or faith in the psychologists’ nature) should 
be empirically examined. Indeed, it may well 
be possible to develop debriefing procedures 
that not only nullify negative reactions to the 
pipeline, but convert them to positive growth 


experiences. 
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Consider next, the matter of the pipeline's 
practicality. Unlike the rating scale, the pipe- 
line presents difficulties in constructing the 
apparatus, transporting the apparatus, run- 
ning more than one subject at a time, and 
requiring excessive time for instructing each 
subject on how to use the apparatus and how 
to believe in it. But these are tolerable lia- 
bilities; experimenters should be willing to 
put in the extra effort if the pipeline's su- 
periority can be established. 

There is one practical problem that imposes 
a potentially serious limitation on the pipe- 
line. If the pipeline becomes routinely 
adopted, the nature of its deception would 
probably become widely known thereby mak- 
ing it difficult to obtain truly naive subjects. 
This would be especially true for the under- 
graduate culture, a relatively closed svstem 
that can be quite efficient in transmitting 
information of this kind to future generations. 
This requirement of subject naiveté, vital to 
the pipeline's success, is not shared by ordi- 
nary rating procedures. 

This feature of the pipeline presents a net- 
tling problem to its advocates. If it is true 
that widespread adoption of the paradigm, 
such as is facilitated by its publication in the 
Psychological Bulletin, will lead to awareness 
of its deceptive nature by the undergraduate 
subject pool, then the pipeline will become 
unusable for such subjects. And the more 
often it is used, the more rapidly this point 
of invalidity will be reached. There is no way 
of concealing its deceptive features short of 
withholding this information during debrief- 
ing. Even then, knowledge of the technique 
may be disseminated in other ways. Articles 
in the Vew Vork Times Magazine, Psychol- 
ogy Today, or the behavior section of Time 
magazine could hobble the technique for years 
to come, 


CONCLUSIONS 
Whenever a new measurement procedure 
is proposed, it is necessary to examine fully its 
strengths and weaknesses. The task of test 
evaluation is made somewhat easier when a 
widely used technique, such as the rating 


scale, exists as а standard of comparison, In- 
stead of having to make an absolute judgment 
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of good or bad, one can make the comparative 
decision of better or worse. 

Four bases of comparison have been out- 
lined: relative sensitivity, functional compara- 
bility, ethical character, and practicality. 
Evaluating the pipeline's merit in comparison 
with conventional rating formats on these 
grounds leads to the conclusion that the pipe- 
line's superiority has not yet been proven. 

Тһе data reviewed by Jones and Sigall are 
encouraging, but not conclusive regarding the 
relative sensitivity of the pipeline. There was 
some suggestive evidence that it may contain 
more true variance than the rating scale and 
that its scale origin might be more veridically 
located. 

Whereas the pipeline may prove to be su- 
perior on sensitivity grounds, it is decidedly 
less preferable than the rating scale on both 
ethical and practical grounds. If the extra 
sensitivity is not needed for testing the hy- 
potheses under consideration, it would appear 
preferable to continue using a traditional rat- 
ing format. 

Clearly the most important potential con- 
tribution of the pipeline lay in demonstrating 
that it leads to different functional relation- 
ships than the rating scale. If the pipeline 
could provide tests of hypotheses unobtain- 
able with other measures, then researchers 
should be more willing to cope with its ethical 
and practical drawbacks. 

It would seem, then, that establishing func 
tional noncomparability would be of primary 
concern, especially in view of the possibility 
that the pipeline has a limited half-life due E. 
people becoming aware of its deceptive vwd 
If such noncomparability can be establishe¢, 
then additional research is warranted to de- 
termine (a) whether the pipeline contains 


i : he 
different or fewer response biases than E 
rating scale and/or (b) whether it пейш! 

p.18 = 


a different property of attitude thar 
sessed by the rating scale. 

Tf the pipeline is shown to be different oe 
the rating scale, an alternative to the pipe е 
should be sought that, while possessing y 
pipeline’s essential properties, would not the 
the ethical and practical deficiencies оше 
pipeline inherent in it. If the primary es se 
ences were found to be in regard to T 
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Courage honesty in a way comparable to the 
pipeline could be tested. On the other hand 
if the differences between the two MENES 
were due to their tapping different facets of 
attitude, then instructional sets could be pre- 
pared that sensitize the subject to the specific 
aspect of attitude that the investigator wished 
to elicit, Successful attempts at such differ- 
ential operationalization of affective, conative, 
t cognitive components of attitude have 
ееп achieved by Kothandapani (1971) and 
Ostrom (1969) when employing rating tech- 
niques, " 
nne and Sigall have developed a very 
Tie and potentially informative meth- 
empl iem device. One approach they have 
is ч for establishing its value, conduct- 
the Pm experiments that differ only in 
Serves ependent measures being compared, 
ing | a model research design for examin- 
tain] pendent variable differences. It is cer- 
у superior to most correlational ap- 
Tee that have been proposed (e.g., Gul- 
Bees | 968). Few other critics of self-rating 
such in attitude measurement have initiated 
phi g careful approach to examine the em- 
E inq of their criticisms. One 
interest 7 Jones and Sigall, as well as other 
йорту. investigators will jeontimue this 
ences z of research on the pipeline's differ- 
eed om the rating scale. In such research, 
Should ен ам formats of Ше rating scale 
- be used rather than the version adopted 
Жүн and Page (1971). The pipeline 
ls nen EE with the best, gene eso 
Ormats. › of the conventional rating SC 
Problen n the ws 
ogy m e outlined above, social psy t 
understa yan substantial benefit in € 
Glan nding the components of attitude 
is а response biases in rating measures. 
Ppears to be a more profitable use of 
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the pipeline’s possibly limited half-life than 
employing it in situations (such as testing 
dissonance predictions) where an ordinary 
rating scale should be adequate. One impor- 
tant feature of such research is that it culmi- 
nates in the obsolescence of the very measure 
it seeks to validate. This temporary and re- 
stricted use of a research paradigm could be 
profitably applied to other questionable 
methodologies. 
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The “bogus pipeline” is defended, not as an all-purpose substitute for conven- 
tional rating scales, but as a procedure useful in certain settings to inhibit 
social desirability influences or to explore the affective components of attitudes. 


Previous studies do show reliable differences 
and conventional ratings, though the interp E 
equivocal. Issues of ethics and practicality are discussed with the conclusion 


between bogus pipeline estima 
tion of these differences rer 


that difficulties with the bogus pipeline are far from unique to this procedure. 


We confess a decided ambivalence toward 
оъ (1973) critique of our *bogus pipe- 
line paper. We do not know whether to be 
pleased with the conclusion that “Jones and 
Sigall have developed a very interesting and 
potentially informative methodological de- 
vice [p. 259]" or to be dismayed by being 
referred to as “rotting organic matter" gen- 
erating “swamp gas.” 

The reader who has not seen the original 
paper might be persuaded by Ostrom that we 
have urged the abandonment of conventional 
rating scales in favor of bogus pipeline pro- 
cedures, Thus, we note Ostrom's reference to 
the issue of adopting the pipeline “аз a stan- 
dard format for the self-rating of attitudes 
[p. 253]." And, again, there is a reference to 
whether or not the “pipeline should routinely 
be used in attitude research as a replacement 
for the rating scale [p. 257].” The straw man 
introduced by such remarks is inconsistent 
with the thrust of our paper. At no point did 
we advocate, or even consider, the routine 
substitution of the pipeline for conventional 
rating scales. In one of our concluding re- 
marks (Jones & Sigall, 1971), we emphasized 
that “it is important not to embrace too hast- 
ily the proposition that the bogus pipeline is 
necessarily a better or truer measure of atti- 
tude than the standard devices [p. 363].” It 
was certainly our intention to be extremely 
cautious in claiming advantages for the bogus 
pipeline, and our effort was to indicate by 
example some of the particular circumstances 
where it might produce different results. We 
completely agree with Ostrom that the dif- 


+ Requests for reprints should be sent to Edward 
E. Jones, Department of Psychology, Duke Univer- 
sity, Durham, North Carolina. 


ferences between rating scale and pipeline rej 
sults cannot at this stage be rigorously 1M- 
terpreted. 

We do not agree, however, that no impor- 
tant differences have been demonstrated. 
Ostrom draws a sharp distinction between 
“sensitivity” and “functional comparability- 
We believe that such a distinction is very 
difficult to maintain, or at least that its 1m- 
portance has been exaggerated. Significant 
crossover interactions are, of course, dramatic. 
In our suggestion that the pipeline, in some 
conditions, will convey a more accurate 019) 
ture of attitudes, we make no claim that pipe 
line results always or often reverse rating 
scale results. A number of studies show S18 
nificant effects with the pipeline measure a?“ 
not with conventional rating scales (82 
Cooper, 1971). To argue that such diferet a 
can be overcome by simply increasing e 
number of subjects in the rating condition 
fallacious. If the rating results of two expe i 
mental conditions are practically uper 
is hard to see how the addition of "P. 
subjects will automatically guarantee а ee 
nificant difference, Also, a significant г 
over interaction was obtained in а study gus 
Jones and Wein (1972) that compared pos a 
pipeline estimates with ratings in judge ad 
person who expressed increasing ог de y 
opinion similarity to the subject. In gen 
we do not see why crossover interac 
any more necessary for testing rela main 
hypotheses than other interactions and 
effects. праг“ 

Ostrom spends considerable space po а 
ing the Sigall and Page (1971) study WI vi. 
earlier study by Karlins, Coffman, a" js that 
ters (1969), His main argument here ! 
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the latter results fall in the same pattern as 
the bogus pipeline results in Sigall and Page 
Whereas the conventional rating results do 
not. Although cross-study comparison ignores 
the “vast difference in approach and pro- 
cedure [Sigall & Page, 1971, p. 25]," we 
agree that the rating scale results may have 
been contaminated by social desirability 
Standards, Indeed, Sigall and Page (1971) 
explicitly acknowledge this possibility and 
emphasize that “the evidence we have gath- 
ered, which suggested that social desirability 
p affected our rating scale subjects, does 
ч In апу way prove that such demand char- 
Cteristics were operative in any of the 
faceton studies [p. 253].” What the study 
e Show is that the bogus pipeline inhibits 
саадаг of social desirability norms and 
for о у е scores that are higher 
Americans and lower for Negroes. 


Ethic ee 
hics and Practicality Issues 


b Certainly agree that the use of the bogus 
e raises complex ethical issues and en- 
should Ostrom’s conclusion that the pipeline 
rating not routinely replace the conventional 
ext ES We also accept the general con- 
research к which Ostrom chooses to discuss 
Ul scien ethics, namely, that the gain in use- 
Shoulq titie information about human nature 
volved E the potential dangers in- 
or the individual subject.” 

a | We see it, the basic danger involved for 
Ndividual subject is the distress concelv- 

i A Caused by an invasion of his privacy. 
nij. 1пуазїопв are also implicit in the ad- 
boni of projective tests and other 
feyde probes whose interpretation js not 
сасу ent. If our assumptions about the 
Jects a the bogus pipeline are correct, sub- 
attitudi be induced to reveal attitudes or 
f ^ nal components that they would pre- 
With aa Uh Certainly in most cases, and 
istres attitudes, this is less likely to cause 
i an many experimental situations 1n 
€ subject is induced to prevaricate, to 


Che. 

at Ы 

The + OF to bring harm to another subject. 
e bo: © 


~~ S'S pipeline subject is "of forced to 


xm 
on at relativistic context is a guiding assump- 
nv olo he latest draft prepared by the American 
“сар Bical Association's Ad Hoc Committee on 


tandards for Psychological Research. 
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acknowledge to others or to himself that he 
would respond differently to a conventional 
rating scale. Even if the subject becomes 
aware of this possibility, Ostrom himself ad- 
mits that a sensitive debriefing can result in 
a “positive growth experience.” In our own 
work we have run across very few subjects 
who were even mildly upset by the bogus 
pipeline procedure. 

Ostrom argues that the bogus pipeline in- 
volves an unusual form of “double decep- 
tion,” but nothing he subsequently says justi- 
fies the use of such a label. Somehow, unlike 
other deception experiments, the bogus pipe- 
line allegedly convinces people against their 
will. Distress is caused by the subject’s reali- 
zation that he has been “taken in” by the 
experimenter. We fail to see how a bogus pipe- 
line experiment involves more, or a different 
kind of, deception than any other experiments 
attempting to convince subjects that what is 
false is really true. We are particularly sur- 
prised by the suggestion that subjects might 
prefer to believe that the machine really 
works and will not, therefore, accept the de- 
briefing. On the contrary, subjects can be 
easily led to understand the deception and 
the reasons for it; they are normally relieved 
to learn that such a measuring device is still 
beyond the grasp of psychological science. 
Still, the ethical issues associated with use of 
the bogus pipeline cannot be summarily dis- 
missed. Each investigator will have to ask 
himself (and hopefully his colleagues) whether 
the probable knowledge to be gained is worth 
the potential of distress to subjects. 

The practicality issue is only an issue if 
one assumes that every social psychologist 
will rush to build his own pipeline and that 
bogus pipeline studies will sweep the cam- 
puses of America. While such a fantasy flat- 
ters the technique, we have urged that the 
bogus pipeline should only be used when im- 
portant theoretical issues are involved, when 
subjects are otherwise likely to resist disclo- 
sure of their attitudes, or when the experi- 
menter is interested in tapping different atti- 
tudinal components. Overconcern with the 
practicality issue would long ago have ruled 
out conformity studies, self-esteem manipu- 
lations, obedience research, in short, any ex- 
perimental program that involves fairly stan- 


dardized deception techniques. It is hard to 
argue that our knowledge of human nature 
has not been advanced by such deception re- 
search. 

In conclusion, we agree with Ostrom that 
the meaning of bogus pipeline results must be 
clarified by further research. We disagree that 
no important differences between rating scales 
and pipeline estimates have been revealed to 
this point. We do not see inherent in the 
bogus pipeline paradigm any ethical or prac- 
tical difficulties that would single out the para- 
digm and distinguish it in this respect from 
other deception research. And finally, we be- 
lieve that Ostrom's message is a confusing one. 
On the one hand, he argues that nothing has 
turned up to justify a claim for the superior- 
ity of the bogus pipeline so that the ethical 
prohibition should stand. On the other hand, 
he exhorts us (and others) to continue to 
probe through the use of further research 
for any differences that might exist. It seems 
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to us that there is an inherent contradiction 
in these exhortations. 
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arce mental health services and previous research with traditional 


therapies, innovative treatment approaches are employed increasingly with, 


low-income patients. Pertinent data 
present trend produce separate 


must be carefully considered lest the 


and unequal services for various social groups. 


While socioeconomic status appears to be a significant. correlate of acceptance 


for, and duration of, individual psych 


otherapy, it does not relate to treatment 


outcome. Clarification of this discrepancy requires a more precise identification 


of the social groups comprising the “1 
the environmental demands faced by 
ment of specific socioeconomic status 
viven patient from a given 


ower-classes"; a clearer understanding of 
such patients; and an objective assess- 
factors that may, or may not, exclude a 


treatment. Resolution of such issues is necessary 


for a realistic allocation of available mental health services. 


Бе warning sounded by Hollingshead and 
а (1958) more than а decade ago re- 
ains equally pertinent today: 


dii, erestion that different social cla: receive 

a iy treatment for mental illness ma come as 

tasteful but to repress facts because they are dis- 
ead t and incongruent with cherished values may 

we s Consequences even more serious than those 

re Are trying to escape by substituting fantasy for 
ality Гр. 4] 


he the critical discrepancy between the 
Cows nd for mental health services (Zax & 
availali 1972) and their current or projected 
creas ility (Albee, 1969), it is becoming in- 
Моны important to identify patient popu- 
tions S for whom given therapeutic interven- 
èr are effective. Accordingly, Strupp and 
Y (1969) identify as the principal em- 
apy: « question for future research in ther- 
Produ What specific therapeutic interventions 
дар ч Specific changes in specific patients 
is еше conditions [p. 20]?” While 
€ re an extremely complex question, it must 
ealistic lved if services are to be allocated 

ically, 
ji; Ph optimal mental health services for 
ing ae individuals, research findings deal- 
ith their experiences in applying for out- 
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patient treatment at mental health facilities 
must be critically assessed. Notwithstanding 
the availability of relevant data, programs 
continue to be designed and operated, based 
on strongly held views without empirical 
grounding. Given the recent proliferation of 
innovative treatment approaches for the poor 
(e.g., Guerney, 1969; Minuchin, Montavo, 
Guerney, Rosman, & Schumer, 1967; Small, 
1971: Sobey, 1970), therapists’ treatment op- 
tions have increased dramatically. Economic 
factors decrease the likelihood that such pa- 
tients will be assigned to classical therapy 
approaches. Lerner (1972) raises several im- 
portant issues relative to such decisions and 
emphasizes the need for careful considera- 
tions of therapeutic effectiveness lest two 
separate yet unequal treatment systems evolve 
—one for the poor and another for the well- 
to-do. In this context, the present paper re- 
views findings dealing with low-income pa- 
tients applying for mental health services at 
clinics emphasizing traditional treatment 
approaches. 
Crass IV VERSUS CLASS V 

Defining à group's characteristics is an 
essential precondition to considering its ex- 
periences! The most widely used instrument 
for determining socioeconomic status In psy- 
chotherapy research has been the Hollings- 
head Index of Social Position, developed for 
use in the New Haven community study 
(Hollingshead & Redlich, 1958). Tnitially, the 
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TABLE 1 
OCCUPATIONAL, EDUCATIONAL, AND FAMILIAL CHARACTERISTICS OF FIVE 
SOCIOECONOMIC STATUS LEVELS 
T 1 = 
Class Occupational level | Educational level 


Family structure 


Salaried positions in policy- 
I | making executive level; | 
private-practice professionals | 


| Professional degrees; A.B. level 
and beyond 


Modal nuclear family of parents 
and children, with stability en- 
couraged 


Salaried positions in business and | A.B. level or partial college Modal nuclear family of parents 
II | professions; minor professionals and children with stability en- 
0) included couraged 
“Middle-class’’ administrative, High school diploma Modal nuclear family of parents 
ш clerical, sales, technical, and and children with stability €n- 
semiprofessional positions couraged 
IV “Working-class”’ skilled and High school or technical | Modal nuclear family often three 
semiskilled manual occupa- school diploma with some generations, instability more 
tions in unionized trades and below tenth grade common than I-III 
industries | | 
| | 
"Poor" semiskilled and unskilled | High school diploma in- Modal nuclear family extended to 
V manual occupations non- | frequent with many not | 3-4 generations; divorce, 
unionized with irregular em- | completing eighth grade separation, and instability com- 
ployment mon 
Note, The socioeconomic status levels are referenced in Hollingshead and Redlich, 1958, 


Index defined class status in terms of three 
variables: area of residence, occupation, and 
educational background. Later, it was revised 
to exclude residence (Hollingshead, 1957) 
and a correlation of .97 between these two 
forms has been reported (Myers & Bean, 
1968). The Index identifies five socioeconomic 
status levels, the highest being Class I; the 
lowest, Class V. Table 1 summarizes the 
major characteristics of the classes. Although 
the Index was developed specifically for the 
New Haven community in the 1950s, it is 
assumed to have some continuing generality. 
Future revision of this and related indices, 
however, should reflect the profound educa- 
tional, economic, and social changes of the 
last two decades—changes felt mostly by low- 
income groups. Recent studies of low-socio- 
economic-status groups continue to rely on the 
Index as the critical socioeconomic status 
measure, without correction for ongoing so- 
cial change. Findings to be reviewed must be 
evaluated with this limitation in mind, 

The two low-income groups (Classes IV 
and V) have generally been perceived as simi- 
lar on the basis of certain commonly shared 


characteristics. Both are typically involved in 
manual occupations, are removed from the 
middle- and upper-class "prestige race," are 
"down-to-earth," are pragmatically oriented 
(Miller & Riessman, 1969), tend to live in 
informal, comfortable relationships with oth- 
ers, and appreciate the affectionate bite of 
humor (Gans, 1962). These communalities 
notwithstanding, it is misleading to ignore 
significant differences between these tw? 
groups and thus to consider them as a homo- 
geneous segment of the population. 

Miller and Riessman (1969) suggest that 
the unskilled, irregular, Class V worker lacks 
the disciplined, structured, working-class aP- 
proach. He is less able to cope with life’s de- 
mands because of its seeming hopelessness: 
Although higher socioeconomic status groups 
neither admire nor understand the Class ~- 
working man, they can appreciate the gam 
ucts of his skill and efforts. The poor, ho 
ever, are objects of the prejudice and lac lass 
concern of all other classes, including C 
IV. Class V members are far less likely 
attain middle-class goals and possessions 


: ions. 
are working-class members. Labor union 
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prime means of organization and political 
power for the working class, are often inac- 
cessible to the Class V worker. The combina- 
tion of economic insecurity and familial in- 
Stability limits the poor to a crisis reactive ex- 
151епсе (Miller, 1964) not generally experi- 
enced by Class IV members. Class V exis- 
tence is further characterized by a lack of 
Control over the forces governing an indi- 
Vidual’s existence (Alinsky, 1967; Hagg- 
Strom, 1964), J 
Differentiations among Class V members 
are themselves extensive (Riessman, Cohen, 
arse 1964). The poor include unskilled 
ie ee employed whites and non- 
un S, migrant farm workers, numerous mi- 
Lus Broup members, ghetto dwellers, and 
lee y щн. Sarbin (1970) notes that 
the a consists of both the respectable and 
Poen €staded poor, the latter being continual 
m Cpants in а culture of poverty. Inclu- 
inherit Such a culture identifies one as “dis- 
the _and thereby as “not included in 
ety "s lectivity that makes up the ‘real’ soci- 
Whit teal’ people [Rainwater, 1970, p. 9]." 
and : e the preceding account is descriptive 
Eu ош, it suggests the need to avoid 
өш, references to the “lower classes.” 
differing and V include different groups with 
ignored ; life styles—a distinction typically 
fishes the findings reviewed. | 
(197) 1180 (1969) and Thomas and Sillen 
are strongly critical of the relative 
„Of empirical data about the feelings, 
i о, and behavior of the poor, both 
and nonwhite. A similar lack is also 
and nd Working class. While Gans (1962) 
insig ie oe et al. (1967) provide some 
дого. O the life-styles of the poor, more 
' integrated evaluations of едо 
^ problem-solving capacities, and 
Organizations is needed for these 
del (1970) identifies needed areas 
buti with the poor and emphasizes the 
ese On that such studies would make. 
Such ЁШ, there is little scientific data about 
Whi alters, 
Mache Many innovative treatment ap- 
0; е, ahistorical (Small, 1971; anny, 
` of "bles Mamoto & Goin, 1965), traditiona 
the tend more to consider knowledge 
Cultura] background of patients in 


absen ce 
тофу, 
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trying to understand the evolution of a pa- 
tient's disorder (Savitz, 1952). Lerner (1972) 
questions whether in the absence of such 
knowledge it is possible for the therapi 
empathize optimally with a patient and hence 
establish therapeutic relationships comparable 
to those observed with middle- and upper- 
class patients, This caution must be seriously 
considered in an assessment of previous re- 
search findings with low-income groups. 
Furthermore, therapeutic contracts and goals 
must recognize and operate within the real- 
istic limits of the patient’s environment 
(Chess, Clark, & Thomas, 1953; Green, 
1946). To ignore the social reality of a pa- 
tient’s existence while attempting to imple- 
ment change in therapy may be self-defeat- 
ing. Failure may often follow when the en- 
vironment into which a patient must return 
will not accept such change (Thomas & Sil- " 
len, 1972); at such time it may be a sign of 
health for a patient to refuse change and to 
withdraw from treatment. 

Schneiderman (1965) suggests that it might 
be helpful to the low-income patient if the 
professional assesses his adaptive and prob- 
lem-solving capacities in relation to the situ- 
ations in which they occur. Without such 
knowledge, the professional may be subject to 
ethnocentric prejudgments about the patient's 
behavior. Healthy middle-class solutions may 
not be healthy for the problems of low-income 
people. Cohen (1964) and Fishman (1969) 
have both suggested that some forms of be- 
havior, deviant by middle-class standards, are 
the only available mastery outlets for low- 
income individuals and must be used to avoid 
loss of self-esteem or a sense of degradation. 
Grier and Cobbs (1968) perceive few other 
alternatives as typically available to ghetto 
dwellers. For the poor, aggression and acting 
against authority figures are important 
coping devices in dealing with certain ego 
threats (Grey, 1969). Such behaviors may be 
necessary substitutes for the low-income per- 
son’s impulses to cry, plead, or talk things 
over—each socially unacceptable (Fishman 
& McCormack, 1969; Silberman, 1964). 
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out 


$ОСТОЕС 
Although Classes IV and V have been de- 
scribed as the most psychiatrically “impaired” 
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segment of society (Srole, Langer, Michael, 
Opler, & Rennie, 1962), they have the least 
chance of obtaining professional help in out- 
pati facilities (Hollingshead & Redlich, 
1958; Srole et al., 1962). Kadushin (1969) 
reports that less than 5% of all patients seen 
at Manhattan Mental Health Clinics were 
members of minority groups—a proportion 
far below general population rates for that 
area. In the studies reviewed below, socioeco- 
nomic status correlates significantly and nega- 
tively with acceptance for and duration of 
individual psychotherapy, with experience 
level of assigned therapist, but not with a 
patient's diagnostic category or source of re- 
ferral. These findings take on greater signifi- 
cance in that the data were drawn from rec- 
ords of clinics in which ability to pay was not 
a condition for treatment. 

A number of reports from the New Haven 
studies indicate that low socioeconomic status 
is negatively related to acceptance for indi- 
vidual psychotherapy (Hollingshead & Red- 
lich, 1958; Redlich, Hollingshead, Roberts, 
Robinson, Friedman, & Myers, 1953; Robin- 
son, Redlich, & Myers, 1954). Similarly, the 
case records reviewed by Schaffer and Myers 
(1954), Myers and Auld (1955), Redlich 
(1958), and Myers and Schaffer (1967) con- 
firm the importance of socioeconomic status 
in treatment decisions. At the New York 
Veterans Administration Regional Office, be- 
ing assigned to psychotherapy, rather than to 
a somatic treatment, was significantly corre- 
lated with a patient's socioeconomic status 
level (Bailey, Warshaw, & Eichler, 1960). 
Therapist’s ratings of patient motivation and 
acceptability for treatment related signifi- 
cantly and negatively to socioeconomic status 
for the 22 Veterans Administration clinics sur- 
veyed (Bookbinder & Gusman, 1964; Raskin, 
1961). Rosenthal and Frank (1958) found 
through an analysis of case records of 384 ap- 
plicants for outpatient treatment that psy- 
chiatrists refer to therapy persons most like 
themselves, that is, whites rather than non- 
whites and those in the upper rather than in 
the lower income range. Class V applicants 
were least likely to be accepted for psycho- 
therapy at the University of Utah Medical 
School Clinic (Codes, Branch, & Allison, 
1962). Shader (1970) noted that low-socio- 
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economic-status applicants to a walk-in clinic- 
were significantly more likely to receive drugs 
than individual psychotherapy. 

Attrition rates among low-income patients 
assigned to individual psychotherapy were 
significantly higher than for their middle- and 
upper-class controls. Myers and Roberts 
(1959) observed that low-income applicants 
accepted for therapy either quit in disgust or 
were dropped as failures. Low-socioeconomic- 
status patients seen in Veterans Administra- 
tion settings terminated prematurely signifi- 
cantly more often than did patients from 
other socioeconomic status levels (Dengrove 
& Kutash, 1950; Sullivan, Miller, & Smelser, 
1958; Winder & Hersho, 1955). 1 

At the Phipps Psychiatric Clinic, social 
class and attrition were significantly and 
negatively related (Imber, Nash, & Stone, 
1955; Frank, Gliedman, Imber, Nash, & 
Stone, 1957; Rosenthal & Frank, 1958; Im- 
ber, Frank, Gliedman, Nash, & Stone, 1956). 
Neither residents nor patients could be “pres- 
sured" by clinic administrators to maintain a 
relationship that was mutually unsatisfying: 
Nash, Hoehn-Saric, Battle, Stone, Imber, and 
Frank (1965) report that socioeconomic status 
related positively to therapist's ratings of pa- 
tient attractiveness, ease of establishing rap- 
port, and good prognosis, each of which 
related significantly to treatment duration. ^ 
review of 300 case records further confirms 
the negative relation between socioeconomic 
status and attrition rates (Katz, Lorr, 
Rubenstein, 1958; Lorr, Katz, & Rubenstein, 
1958). In addition, Schneiderman (1965): 
working in a Family Service Clinic, fou” 
that with declining socioeconomic status leê’ 
there was a greater liklihood that the pest 
worker, rather than the patient, would initia 
termination. E 

Social class has thus been shown to be | 
important determinant of acceptance for; e 
duration of, individual psychotherapy (5an 
1966). Low-socioeconomic-status applica 
are significantly less likely to be assign’ е 
individual treatment, and if assigned, are pe 
likely to terminate prematurely. Tu g 
tions have been observed across a vale T E 
settings in different parts of the ит А con- 
erally speaking, low-socioeconomic-status 
traindicates individual therapy. 
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The prepotency of socioeconomic status in 
treatment decisions is reflected in reports that 
assignment to therapy is more a function of 
Social class than of diagnosis (Myers & Auld, 
1955; Myers & Schaífer, 1967; Robinson, 
Redlich, & Myers, 1934). The eaten ud 
relation between treatment disposition and 
agnosis has been observed in the records of 
сеш Administration clinics (Bailey, 
Varshaw, & Eichler, 1960), of the Phipps 
a= (Imber et al., 1955, 1956), of the New 
е State Psychiatric Institute ( Budner, Es- 
of Cl m Malitz, 1964), and of the University 
ES pue d Medical School (Heine & Tros- 
lowes 260). Among diagnostically comparable 
Es tie applicants for ther- 
em * Ше Los Angeles General Hospital, un- 
E ruant (Yamamoto & Goin, 1965) or 
Jam tity group membership (Yamamoto, 
i es, Bloombaum, & Hatter, 1967) caused 
th em to be seen as even less appropriate for 

пегару, 
рае has been no systematic verification 
Eo affer and Myers’ (1954) finding that 
ind patients are disproportionately 
of m to inexperienced therapists. Studies 
Xu) s oepanomitstatus patients in indi- 
reside therapy have almost exclusively used 
terns ЕТ medical students, or psychology in- 
Baten therapists: senior staff members 
ients ntly did not treat Class IV or V pa- 
Suppo z these studies. This datum indirectly 
is à ds s the view that socioeconomic status 
While er minant of who will treat a patient. 
experi it has not yet been established that 
litte e therapists produce more positive 
972. 168 (Bergin & Garfield, 1971; Lerner, 
Cina! Менго & Kornreich, 1970) Sanua’s 
argument that lower income patients 
t appreciate inexperienced therapists, 
fe 5 consideration, "UN 
viduals tendency to see Class IV and V indi- 
b OWer., и part of a blurred, homogeneous. 
Questio, ass” obscures essential empirical 
Dent ns, It is valid to ask whether treat- 
Rroupg -comes are the same for these two 
twee, All too often comparative analysis 
causa Classes IV and V has been impossible 
With p, Amples studied were not even mo 
Sbplica А, distinction in mind, That Class ДУ 
арса S differed significantly from Class i 

nts in acceptance rates for therapy 


May no 
deserve 
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(Coles, Branch, & Allison, 1962) suggests 
that Class IV and V treatment experiences 
are not identical. Furthermore, unemploy- 
ment and minority group membership, char- 
acteristics of Class V, significantly differenti- 
ated low-income applicants accepted for ther- 
apy (Yamamoto & Goin, 1965; Yamamoto 
et al., 1967). Although far from conclusive, 
such data underscore the need to identify more 
systematically, specific socioeconomic-status- 
related factors involved in treatment decisions. 

Conclusions about entering or remaining in 
therapy drawn from the studies reported, do 
not establish a relation between socioeconomic 
status and treatment outcome (Truax & Cark- 
huff, 1967). Katz, Lorr, and Rubenstein 
(1958) did not find social class differences 
among patients designated as “successes” by 
their therapists; all socioeconomic status lev- 
els were represented approximately equally 
among successful patients, Similar observa- 
tions have been reported by Rosenthal and 
Frank (1958), Brill and Storrow (1960), and 
Coles, Branch, and Allison (1962). Albronda, 
Dean, and Starkweather (1964) provide sug- 
gestive evidence supporting Frank’s (1961) 
view that overcoming reality factors associ- 
ated with continuation in therapy for low- 
socioeconomic-status patients may reflect an 
extremely strong motivation for change. After 
the eleventh interview, low-income patients 
responded slightly better to treatment than 
did higher socioeconomic status controls. The 
potential for positive outcome in traditional 
treatment approaches with low-income pa- 
tients has been documented in an important 
study reported by Lerner (1972). In addi- 
tion, the economic feasibility of providing 
such treatment was also supported by Ler- 
ner’s (1972) findings. 

The studies reviewed also ignore the fate of 
those rejected for, or rejecting, treatment. 
Brandt (1964) and Riess and Brandt (1965) 
provide data suggesting that significant num- 
bers of “rejectees” enter treatment else- 
where. Does the low-income “rejectee” apply 
and meet a similar fate at another mental 
health facility? Is he accepted for treatment 
elsewhere? Conceivably, the initially rejected 
low-income applicant may become part of a 
pool of repeatedly rejected applicants, ac- 
cepted low-socioeconomic-status applicants, or 
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successful therapy cases. Data bearing on such 
questions do not exist! 

The observed discrepancy between accep- 
tance and outcome data for low-income ther- 
apy patients has generated a number of ex- 
planations—few empirically tested. Truax and 
Carkhuff (1967) suggest that the acceptance 
data typically reflect prejudices common to 
the field. Gundlach and Geller (1958), Mil- 
ler and Mishler (1959), and Koegler and 
Brill (1967) conclude that selection factors 
related to socioeconomic status more often 
meet therapist needs than treatment require- 
ments. Fierman (1965) dismisses all socio- 
economic status limitations to treatment as 
myths wi no basis whatsoever in reality. 

his view, the very factors used to rational- 
ize exclusion of low-income individuals from 
treatment justify their need for therapy. 

Tn a satirical essay, Adams and McDonald 
(1968) point up an analogy between the recep- 
tion of low-income applicants at clinics and 
the “cooling-out” practices that con men use 
on their victims, They suggest that the “real 
purpose” of the intake interview is to amass 
evidence that the poor are neither treatable 
nor motivated. Similarly, McMahon (1964) 
argues that the fact that therapists “permit” 
so many of the poor to refuse treatment, 
without allowing them to understand what 
they are rejecting, is tantamount to a “de 
facto practice of disengagement from the 
poor [p. 287].” Chessick (1971) states that 
treatment decisions based merely on a pa- 
tient’s economic qualifications violate a basic 
human right “to be able to receive the best 
possible medical care regardless of socioeco- 
nomic status [p. 37].” 

Although the current social climate favors 
the practice of indicting policymakers for ap- 
parent prejudicial distribution of services 
(Thomas & Sillen, 1972), the soundness of 
explaining the findings on this basis alone can 
be challenged. In fact, the data reviewed indi- 
cate that Class IV and V applicants simply 
do not receive, or continue in, individual psy- 
chotherapy. The defensibility of the view that 
rejected or prematurely terminating patients 
would have profited from therapy as much as 
“successful” low-socioeconomic-status patients 
do, is questionable. Furthermore, there is no 
evidence indicating that terminators did not, 
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in fact, profit to some degree. There are far 
too many individual difference variables to 
justify such conclusions without solid objec- 
tive data. Clearly, it is inequitable for socio- 
economic status factors alone to determine 
who should receive treatment and in what 
form. The discrepancy between acceptance 
and outcome data underscores this fact. 
Equally, it is unreasonable to assume that 
alternative forms of treatment do disservice 
to those not receiving individual therapy. 
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The first proposes that acute schizophrenics, 
as they become chronic, progressively focus 
attention more narrowly in order to avoid the 
aversive psychological disorganization pro- 
duced by normal ranges of cue utilization 
(e.g., Broen & Storms’ [1967; Broen, 1968] 
response interference theory; Johannsen, 
Friedman, & Liccione, 1964; Mednick, 1967). 
The second hypothesis is that this narrowing 
of attention is a reaction to the impoverished, 
redundant environment of the hospital to 
which the chronics are exposed for long pe- 
riods of time. This change then is not intrinsic 
to the course of the psychosis (Pishkin, Smith, 
& Liebowitz, 1962; Silverman, Berg, & Kan- 
tor, 1965). 

Data consistent with both the course-of- 
psychosis and effects-of-institutionalization 
hypotheses have been reported (Johannsen & 
O’Connell, 1965; Silverman et al., 1965) 
but unequivocal conclusions are not pos- 
sible. The only generalization which may be 
made from this rather large literature (cf. 
Schooler & Feldman, 1967) is that acute and 
chronic schizophrenics tend to differ on a 
number of behavioral measures. The reasons 
for this rather restricted conclusion are dealt 
with in the following sections. 


RESEARCH DESIGN IN CHRONICITY STUDIES 


Silverman (1964a, 1967), for one, has sug- 
gested that the variable of chronicity is crucial 
to the resolution of apparently inconsistent 
empirical findings and to the development of 
appropriate theories of perceptual, cognitive. 
and autonomic functioning in schizophrenia. 
The studies on which his analysis was based, 
and those subsequent to his reports (cf. 
Broen, 1968; Neale & Cromwell, 1970) have 
used cross-sectional designs. Common to these 
studies are (a) the sampling of an inpatient 
population at a single point in time: (5) the 
exclusion of subjects likely to introduce ex- 
traneous variance or to increase variability: ? 
(c) the use of a time-bound metric, for ex- 
ample, length of illness, to classify subjects as 


3 These typically include organic brain disease, al- 
coholism, recent electroconvulsive therapy and un- 
cooperativeness, The biases introduced by these 
exclusions, especially the last, are not unique to this 
area (cf. Chapman, 1963; Klein & Spohn, 1964). 
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acute or chronic. The two groups differ sub- 
stantially in chronicity: The shortest time ill 
chronic subjects exceed the longest time ill 
acutes by one or more years. (d) The уап- 
ables of premorbid adjustment and/or para- 
noid symptomatology are ignored in selecting 
subjects. While especially true of studies prior 
to Silverman's (1964a) review, most later 
studies sample only one of these two variables 
(e.g., Yates & Korboot, 1970). (е) Group dif- 
ferences on the dependent variable(s) are at- 
tributed to the selection principle explicitly 
employed in the composition of the groups, 
that is, chronicity. + 

А number of aspects of this paradigm limit 
the conclusions which may be drawn from the 
empirical findings. As with any static-group 
comparison (Campbell & Stanley, 1963), the 
attribution of the source of the behavioral dif- 
ference is questionable. The groups may also 
differ on other variables whose relationships to 
the dependent behavioral variable are to vary- 
ing degrees not known (McGhie, 1970). Fur- 
ther, as will be demonstrated below, acute and 
chronic groups do differ on variables of known 
importance: life history and symptom char- 
acteristics. Studies could be conducted to de- 
tect potentially confounding relationships and 
their possible effects could be “controlled” by 
means of subject selection or statistical tech- 
niques. This would be of limited utility, how- 
ever, for the fundamental source of the uncer” 
tainty of the meaning of behavioral differences 
between acute and chronic subjects is the use 
of static-group designs to examine develop- 
mental hypotheses. The inappropriateness : 
these designs may be best appreciated by tne 
consideration of typical definitions of chro- 
nicity. 


Construct OF CHRONICITY 


Within the time-linked conception employed 
by most psychological investigators, с E 
nicity refers to either length of time elap: th 
since the onset of schizophrenia, that ouem ; 
of illness, or to the total length of hospita А 
tion since onset (Cromwell, 1972). ch ad 
cases, onset is defined as the time of firs 
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4 It is unclear in the literature uper 
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ACUTE AND CHRONIC SCHIZOPHRENICS 


mission to a hospital, an empirically useful 
but far from ideal criterion (Neale & Crom- 
well, 1970). 

An acute, or more accurately, early-term 
schizophrenic then is (a) a recent first ad- 
mission, (5) a first admission with only a brief 
hospitalization at the time of the study, or 
(c) a multiple-admissions patient with a small 
total length of hospitalization. Chronic or 
long-term subjects are those who (a) were 
first hospitalized a substantial time prior to 
the study and who may or may not have sub- 
Stantial length of hospitalization or (5) have 
had long length of hospitalization. Although 
Investigators have used the several meanings 
and measures of chronicity interchangeably, 
their empirical equivalence has not been dem- 
Onstrated, 

Before the widespread introduction of phar- 
Macological and other treatment changes in 
the mid-1950s, length of illness and length 
of hospitalization were very likely substan- 
tially correlated, This is suggested by Brown's 
(1960) summary of epidemiological survey of 
Patients admitted between 1900 and 1951. 
For first admission cohorts, there was a nega- 
tive linear relationship between probability of 
discharge and length of continuous hospital- 
zation during the first two years after ad- 
раан, In the third year of hospitalization, 
ISCharge rates plummeted markedly and be- 
‘ame stabilized at а low level after three years 
ПВ of hospitalization. During this time, 
most hospitalized schizophrenics were first 

Missions and computing chronicity in terms 

Months since first admission or months of 
o Pitalization since first admission would lead 
v оз! identical numerical values. How- 

oe! the treatment careers of schizophrenics 
Es. Changed markedly since the time period 

i sidered in Brown's study (Smith, Bower, & 
termitte 1965; Yolles & Kramer, ped te 
ing, Pad patienthood (Friedman, Rem life 

inko, 1966), the more typica 
of schizophrenics currently, has almost 
Y attenuated the correlation between 
а Of illness and length of hospitalization. 
S may account for the differences in find- 
etween procedural similar studies 
define chronicity differently (e. Kopf- 
Neale, 1971; Strauss & Parwatikar °). 
› When studies which define chronicity 
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differently show convergences in findings, their 
interpretation still remains unclear. Long- 
term hospitalized subjects and long-term ill 
but only intermittently hospitalized subjects 
differ in significant ways. The latter become 
rehospitalized as a function of psychosocial 
characteristics and their failure to continue 
taking ataractic drugs (Crumpton, 1968; 
Paul, 1969; Rosen, Englehardt, Freedman, & 
Margolis, 1968). The continuously hospital- 
ized subjects, among other things, have not 
responded to phenothiazines more than mini- 
mally. 

Most studies treat chronicity as a typology, 
although it is measured dimensionally. Hence 
it becomes important to select an appropriate 
cutting point for discriminating between early- 
term and long-term subjects, On the basis of 
Brown’s (1960) work, it has been recom- 
mended that two years length of hospitaliza- 
tion be used as the maximum for inclusion in 
the early-term group, and at least three years 
length of hospitalization be used as the mini- 
mum for the long-term group. Subjects in the 
more ambiguous two to three year length of 
hospitalization range are excluded (Neale & 
Cromwell, 1970; Neale, Held, & Cromwell, 
1969). The current validity of Brown’s (1960) 
conclusions may be questioned. Although the 
data are not as extensive for the post-1950s, it 
appears that six months or one year of con- 
tinuous hospitalization are currently more ac- 
curate inflection points in probability of dis- 
charge curves (Gorwitz, Bahn, Klee, & Solo- 
mon, 1966; Yolles & Kramer, 1969). In the 
absence of appropriate epidemiological data 
for current first admission and resident popu- 
lations, it is problematic to determine a suit- 
able length of hospitalization criterion for the 
identification of chronicity. m 

Silverman's (1964a) alternative criteria 
(early term < 3 years length of hospitaliza- 
tion, long term > 6 years length of hospital- 
ization) were based on his analysis of studies 
which suggested that these criteria identified 
patient subgroups which were quite different 
on perceptual and cognitive measures. These 
groups also differed in other important ways 
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(e.g., phenothiazine treatment status, premor- 
mid adjustment, and paranoid status) and the 
theoretical propositions associated with these 
chronicity criteria have subsequently had a 
mixed validity history (Kopístein & Neale, 
1971; Neale & Cromwell, 1970; Neale, Davis, 
& Cromwell, 1971). The meaningfulness of 
these criteria are thus also uncertain. 

The use of cutting points to partition a con- 
tinuous variable into discrete categories aí- 
fects the likelihood of detecting relationships 
with other variables and the nature of the ef- 
fects detected. If length of illness or length of 
hospitalization measures are arbitrarily di- 
chotomized into early-term and long-term 
categories, the statistical power of studies is 
reduced; but if subjects of intermediate chro- 
nicity (e.g., 3 < length of hospitalization < 6 
years) are discarded, the likelihood of finding 
effects may be increased. In either case the 
use of dichotomous chronicity measures pre- 
cludes the detection of linear or curvilinear re- 
lationships with dependent variables. Such 
relationships are easily detectable when ap- 
propriate correlational techniques are applied 
to a continuous measure of chronicity like 
length of illness or length of hospitalization. 

Length of hospitalization appears to be the 
preferred measure of chronicity in the recent 
literature. Like length of illness, length of hos- 
pitalization is an easily collected datum; it is 
obtained from hospital records. However, for 
subjects with multiple admissions at several 
hospitals, length of illness is more easily and 
reliably ascertained as only the date of first 
admission need be determined. The preference 
for length of hospitalization would appear to 
have three sources (a) the historical role of 
length of hospitalization and discharge cri- 
teria as measures of prognosis and outcome 
in schizophrenia (Brown, 1960; Yolles & 
Kramer, 1969); (5) a belief that institution- 
alizational itself produces deterioration in 
mental patients (cf. Williams"); and (c) a 
provocative report by Silverman et al. (1965). 

In this oft-cited study it was found that 
early-term and long-term  institutionalized 
felons differed on the Titchener Circles illu- 
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sion and on the Pettigrew Category Width 
tasks in the same way that early-term and 
long-term schizophrenics were expected to 
differ. This appeared to demonstrate effects of 
institutionalization independently of schizo- 
phrenic psychopathology. However, in the 
study of schizophrenics to which the prisoners’ 
data were compared (Silverman, 1964b) sig- 
nificant early-term-long-term differences were 
found only for paranoids, were post hoc, and 
were clear only on the task for which predicted 
paranoid-nonparanoid differences were not 
found. At best, these unreplicated findings are 
applicable to only some schizophrenics—the 
paranoids. Further, in a study using well- 
established reaction time and conceptual mea- 
sures Best ° found early-term-long-term differ- 
ences for schizophrenics but not for matched 
prisoner controls. 

Johannsen and O'Connell (1965) directly 
questioned the importance of the institutional- 
ization variable in a reanalysis of an earlier 
study (Johannsen et al, 1964). In the 
latter study, in which chronicity was treated 
as the continuous variable which it is, 2 
number of visual perception measures were 
found to be related to length of illness. With 
the exception of the length of illness < 6 
months group, all subjects had spent equiva- 
lent proportions of time since first admission 
in hospitals. If institutionalization itself were 
an important determinant of schizophrenics’ 
behavior, they reasoned, high proportion of 
time and low proportion of time hospitalized 
subjects should differ on perceptual tasks. 
Only one significant difference was found in 
contrast with a number of differences assocl- 
ated with length of illness. i 

Johannsen's work has been cited as а es 
for selecting early-term and long-term subjec i 
using length of illness rather than ng 5 
hospitalization (Neale & Cromwell, 1970): 


TR. H. Best. Cognitive impairment. in T 
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longed exposure to impoverished environmental i 
ditions. (Doctoral dissertation, Southern DG 101 
versity) Ann Arbor, Mich.: University Mic 
1968, No. 69-1739. 
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Given the vagaries of discharge and readmis- 
sion policies (Cumming, 1963; Mosher & 
Feinsilver"), the use of length of illness 
would appear to have much to recommend it- 
self. Of course, if only two groups of subjects 
are selected, the problem of appropriate cut- 
ting points still remains. However, further ex- 
amination of the Johannsen study suggests 
that the length of illness findings may reflect 
the operation of a subject sampling variable 
known to be related to behavioral variation in 
Schizophrenia-paranoid v nonparanoid 
Symptomatology, A secondary analysis by this 
Writer revealed a significant linear decrease in 
or ргороніоп of paranoid subjects as length 
Re hess Increased (linear trend X* = 6.5171, 
/ = 1, p< 02; residual X%, ns). A similar 
pu was found in a secondary analysis of 
E and Zigler’s (1971) recent study of the 
E Tonship i between premorbid adjustment 
с Paranoid symptomatology. When this 
е divided their small sample (N — 37) 
irst-admission and multiply-admitted 
Pos there was a significant trend (X? — 
«eon! üf 1, р < .05) for the latter to be 

5 Оеп paranoid. 
ай" Pothesis other than sampling bias 
older и for these findings. Perhaps 
p. »jects are less often paranoid than 
m T subjects. Also, paranoid symptoms 
C disappeared over the course of ше 
Detieness Pss the repeated weed 
'orzio & ü ШЕ a ct us 
апа Ziel Hed und, 1968). simile ith Apes 
ie ve ers multiply-admitted subjects may 
* been paranoid at first admission." The 


Crucial ; > i a ; 
p issue is: Do the paranoid patients or 
Chro Paranoid symptoms disappear from 
nic or long-term populations? This is a 


Questi 
үш to which we shall return. . 
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ences may be found in the literature. Payne 
(1966), for example, observed that early-term 
subjects were either overinclusive or retarded 
while long-term subjects were primarily re- 
tarded, and speculated that overinclusion 
might be a good prognostic sign. Hence early- 
term-long-term differences on measures of 
overinclusion could reflect the loss of overin- 
clusive subjects from long-term groups rather 
than any cognitive change (cf. also Draguns, 
1963). A small sample follow-up study of 
mental patients, all of whom were not schizo- 
phrenic, provided some tentative support for 
this hypothesis (Payne, 1968). The changing- 
sample-composition hypothesis is thus not a 
new one. It is, however, an unexplored one 
and has substantial significance for the in- 
terpretation of cross-sectional comparison tests 
of the developmental course-of-psychosis and 
effects-of-institutionalization hypotheses. 


SOURCES OF SAMPLING BIAS IN STUDIES 
CoMPARING EARLY-TERM WITH 
LONG-TERM SCHIZOPHRENICS 


Two sources of sampling bias in cross-sec- 
tional studies of early-term and long-term 
schizophrenics (premorbid adjustment and 
paranoid symptomatology) are parameters of 
substantial import in the description of vari- 
ability in schizophrenics’ behavior and are 
dealt with first. Subsequently, this section 
deals with other sources of sampling bias in 


these designs. 


Premorbid Adjustment 
Studies of early-term and long-term schizo- 
phrenics must sample institutionalized popula- 
tions if chronicity is assessed using length of 
hospitalization. Although the use of length of 
illness measures does not require the sampling 
of only inpatient populations, studies have 
been restricted to them. It is well documented 
that premorbid adjustment predicts behaviora 
differences among early-term schizophrenics 
(Higgins, 1969). In general, poor premorbid 
subjects manifest at first admission greater de- 
grees of psychological deficit than do good 
premorbid subjects (Higgins, 1969; Neale and 
Cromwell, 1970). Good premorbid subjects 
ubstantially higher probability of dis- 
significantly shorter durations of hos- 
ion, and 
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are less likely to be readmitted to inpatient 
facilities (Farina & Webb, 1956; Rosen, 
Klein, & Gittleman-Klein, 1971; Turner & 
Zabo, 1968). Thus good premorbids are un- 
derrepresented in long-term samples. Parsi- 
mony would suggest that early-term-long-term 
differences be interpreted in terms of differ- 
ences in premorbid adjustment, rather than in 
terms of differences or changes associated with 
chronicity. 

Investigators aware of the premorbid ad- 
justment bias in randomly selected long-term 
subjects have suggested a matching strategy 
(Neale & Cromwell, 1970). Selecting subjects 
in terms of premorbid adjustment as well as 
chronicity criteria would assure the numerical 
equivalence of the proportion of good pre- 
morbids in early-term and long-term groups. 
This procedure involves the tacit assumption 
that the less frequently found long-term goods 
are a representative sample of good premorbid 
subjects. The good premorbid long-term 
schizophrenic is deviant with respect to out- 
come, however. Is it not likely that he was 
also deviant behaviorally at admission or dur- 
ing the early-term period? The answer to this 
question is not known; behavioral studies of 
early-term good premorbids who experience 
different long-term outcomes could not be 
found. 


Paranoid Symptomatology 


It has been pointed out that paranoid sub- 
jects are less often found in chronic groups. 
Among early-term subjects, paranoids tend 
to manifest less psychological deficit than 
nonparanoid schizophrenics (Broen, 1968). 
Whether the sampling bias position applied 
to the premorbid adjustment variable is also 
applicable here depends on the source of the 
underrepresentation of paranoids in long-term 
groups. As was asked earlier, do paranoid pa- 
tients or paranoid symptoms disappear? 

The answer to this question is uncertain. 
Clinical lore supports either view (Bleuler, 
1950; Neale and Cromwell, 1970; Robins 
and Guze, 1970). It is clear that among read- 
mitted patients fewer manifest paranoid symp- 
toms on second admission (e.g., Morgan et 
al., 1968), but these studies do not indicate 
what proportion of paranoid and nonparanoid 
first admissions are rehospitalized. 
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There are suggestive but nonconclusive 
findings supporting the hypothesis that para- 
noid patients disappear from hospitalized sam- 
ples. First, Sommer and Witney (1961), dis- 
cussing the “chain of chronicity,” noted that 
twice as many nonparanoid as paranoid pa- 
tients were transferred from short-term inten- 
sive treatment to chronic care wards. Para- 
noids were more likely to be directly dis- 
charged. The second source of information was 
a follow-up of Stenn's '' Veterans Administra- 
tion subjects. Three years aíter the initial 
study of these early-term (length of hospital- 
ization < 3 years) subjects, we examined their 
rates of discharge and readmission. Of the 29 
paranoid subjects whose records were located, 
45% were not in hospitals at any point during 
the 3-year follow-up, but 73% of the non- 
paranoids had been either readmitted or not 
discharged during the same span of time (s 
= 2.0737, p < 05). This difference in the 
incidence of paranoids in what are now long 
term subjects supports the loss-of-patients hy- 
pothesis. This conclusion must be viewed as 
tentative as the numbers are small and it is 
possible, although unlikely, that the paranoids 
were rehospitalized in other than Veterans Ad- 
ministration settings. It should also be noted 
that the loss of paranoid symptoms hypothesis 
may also be valid; the initially paranoid sub- 
jects who were rehospitalized may have been 
nonparanoid at readmission. Unfortunately; 
readmission diagnoses were not available. 
Intermittent Patient hood 

Baldly stated, samples of hospitalized long 
term subjects are extremely unrepresentative 
of the long-term schizophrenic population 
whether defined by length of illness or length 
of hospitalization criteria. Drug treatment has 
markedly facilitated the development of treat- 
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Dinitz, 1967). А major determinant of 
Whether a patient will become rehospitalized 
and hence be a potential subject, is whether 
Prescribed ataractic drugs continue to be 
taken (Crumpton, 1968). Rehospitalized sub- 
Jects are likely to be those who have volun- 
tarily discontinued drug regimens and it 
appears that drug discontinuance results in re- 
hospitalization for essentially only poor pre- 
Morbid subjects. The use of a placebo instead 
of an active drug does not affect the hospital- 
ization rates of ‘“nonhospitalization prone" 
(i.e, good premorbid) subjects (Freedman, 
osen, Englehardt, & Margolis, 1967: Rosen 
et al., 1968), | 
Sead factors contribute to nonrepresenta- 
| 5s of hospitalized long-term schizo- 
Ene and the uninterpretability of early- 
long-term differences: Rehospitalized or 
Ec hospitalized subjects are not in- 
Bud Рин, Readmission may de- 
Es ew on degree of psychopathology or on 
COS in symptomatic behavior than on 
Ses Spital living arrangements and other 
cial variables (Brown, Carstairs, & Topping, 
У poe. 1963; Mendel & Rappaport, 
ен Whether cognitive deficits аге mani- 
> “© may depend less on symptomatic status 
B Se educational attainment and motiva- 
On for discharge (Braginsky, Braginsky, & 
ie 1969; Schwartz. 1967). The rapidly de- 
ent E and widely implemented nonhospi tal 
represent, programs virtually p 
ор ê ntativeness in resident long-term schiz- 
ue samples. 
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Consider a hypothetical study comparing 
cognitive functioning in early-term and longe 
term drug-treated, hospitalized, poor pemoc 
bid subjects. Will differences be found? Prob- 
ably not if long-term subjects are selected 
from recently admitted patient populations 
for both early-term and long-term groups 
would be responsive to phenothiazine treat- 
ment. If, however, long-term subjects were 
selected from populations characterized by 
long, uninterrupted hospitalizations, differ- 
ences would probably be observed. One would 
be comparing a maximally drug responsive 
group, poor early terms, with the residual, 
nonresponsive members of a poor premorbid 
cohort, the long terms. The observed early- 
term-long-term differences in the latter case, 
however, may reflect differences between drug- 
responsive and nonresponsive poors which ex- 
isted during the early-term period. 


CONCLUSIONS 


Despite the arbitrary cutting points used 
to differentiate early-term and long-term 
schizophrenics, cross-sectional studies have re- 
vealed behavioral differences between these 
groups. Comparisons of hospitalized early- 
term and long-term subjects, however, are com- 
parisons of differently heterogeneous groups: 
paranoid and nonparanoid, good and poor pre- 
morbid, drug responsive and nonresponsive 
subjects are compared with primarily nonpara- 
noid, primarily poor premorbid, drug respon- 
sive, and nonresponsive subjects, Early-term- 
long-term differences are also affected by the 
selective discharge, retention, and readmission 
of schizophrenics associated with social, psy- 
chological, and psychopharmacological vari- 
ables. Thus the conclusion that these behav- 
ioral differences reflect changes in psychologi- 
cal functioning is not warranted by the data. 
They are equally consistent with a more par- 
simonious changing-sample-composition hy- 
pothesis which does not require the inference 
of intraindividual change to account for inter- 
individual differences. The study of changes in 
psychological processes during the course of 
this psychosis requires the longitudinal inves- 
tigation of first admission cohorts, not the 
cross-sectional comparison of inpatient sam- 


ples. 
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Theory and research on the 


viewed. Theories stressing power centralization are 
sharing of decision-making power. 
and group decision 


temporary emphasis on the 
process approach to 
previous research and directs future 


mediated by 
and individual motives. 


“One overriding impression conveyed by 
surveying the literature of the 1960s, in con- 
trast to the preceding two decades, is the re- 
direction of interest in leadership processes 
such as power... relationships [Hollander 
& Julian, 1969, p. 388 |." This paper contrasts 
theoretical emphases оп power centralization 
and power sharing and presents an integrative 
el focusing on interactive organizational 
proce: situation. determinants of power re- 
lationships and organizational behavior. The 
model is presented in Figure 1. 

The general approach it represents has two 
basic elements. First, organizational develop- 
is characterized as a process system 


mod 


ment 
(Katz & Kahn. 
outputs through. th 
cal processes and W 
back into the system 
Second. the relations 
classes of parameters are n е 
vidual and situational ee 
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power relation- 
UAE ristics and 


1966) with inputs generating 
e mediation of psychologi- 
ith these outputs feeding 
to affect further inputs. 
hips between the three 
mediated by indi- 
The major 
xamined in 


intervening 


{ decision characte 


ships. outputs 01 e us en 
organizational effectiveness. and 1 bic 

$ i gu Li € 
variables associated with the organi ali E 
me, dic and individua 


context. decision-making tasks. 
should be addressed to 
p-ycholozs- Ohio 
Avenue. Colum 


reprints 
Department 
i01. C West 17th 


| Requests for 
Michiel T Wood. 
State University - 
bus, Ohio, 43210 


ol 


distribution of power in organizations are Ye- 


power relationships 
study. 
’ tionships as intervening processes in organ 
determinants and outcomes. The group decision process is characterized as a 
multiphased process, in which participation, 
interaction dynamics affect power relationships. Linkages betw 
processes and power, and between power 
characteristics of the organizational environment, decision tasks, 


contrasted with the con- 
A situational- 
aking integrates 
This approach views power ri 
ational development, with multiple 


mutiple bases of power, and 
n decision 
and organizational consequences, are 


differences in motivation, Previous models of 
personnel decisions (Dunnette, 1966), and 
leadership effectiveness (Campbell, Dunnette, 
Lawler, & Weick, 1970; Fiedler, 1967) offer 
precedents for a situational-process approach. 


THEORETICAL Bases OF POWER SHARING ANP 
Group DECISION MAKING 

Early writings on the management of busi- 
ness and industrial y ^l emphasized 
the centralization of power in top manage- 
ment levels. Given the assumptions that the 
dominant worker motivation toward security 
could be fulfilled through economic reward, 
and that organizational efficiency was maxi- 
mized through job simplification, centraliza- 
tion was in the mutual interest of the worker 
and the firm (Taylor, 1947). Functional view- 
points dictated specialists in planning anc 
coordination (Gulick, 1937), and princip es 
of management and control were oriente 
toward formal, hierarchical organization struc- 


tures. Fayol (1949) argued for unity of com- 
mand and scalar power chains; employee? 


should receive orders from only one superio! 


who controls homogeneous sets of activities: 
and excepting emergency situations, commun! 
cations must travel up the hierarchy and be 
relayed through management down to the 


Jower levels. | u 

Bureaucralle theory ея 
1 management posi ў 
and distinct author! 
is incol 


(Weber, 1960) vested 
s structured by 
ty gradations. 


ower il : 
npatible with 


formal rules 


а " -4 Wwe 
Group decision making W4 
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E ё 2 Intervening 
ae — * eee Psychological * —— Organizational 
‘ganizationa 
nputs Proce Consequences 
LG ET =" r З 
roup decision making 1. Power relationships 1. Decision characteristics 


a. Participation in phases 
b. Multiple power inputs 
с. Group interaction 


2. Other development programs 


*Situational and Individual Mediators 


Fic. 1, A situational-process approach to or 


these structural positions. Planning boards 
and committees were implemented only in 
Special circumstances since they violated the 
Unity of command principle and brought peo- 
Y ple from different authority levels together to 
render decisions, These administrative theories 
Of organization and their implied power rela- 
rains lacked empirical support. Today 
tg ШЕ often treated as “straw men” (Ben- 
modif 70) ог their assumptions are greatly 
Udy pe (Gouldner, 1954; Merton, 1963; 

| У, 1966). 
Tow: organization 5 | 
hese classical assumptions in 


theorists have aban- 
favor of 
multi- 


com- 


| 

۸ 

\ 1 P; ck 

| ао" Need Hierarchy 

ш Maslow’s (1954) “need hierarchy” concept 


ees the primary role of economic moti- 

Works ; This position assumes that es 
eray order to satisfy needs: (0) needs are 

| lowest р У prepotent and categorized from 
cial = highest level as physiological, safety, 
teq e and self-actualization: 
Ower Te, do not motivate behavior: We" © 
evel need is satisfied, the immediately 


(c) satis- 
when а 


i 


2. Other process 
a. Affiliation 
b. Trust 


a. Quality 
b. Acceptance 
c. Risk 


2. System effectiveness 
a. Role attitudes 
b. Role performance 
c. Other criteria 


ational characteristics 


ual motives 


а 


vanizational power relationships. 


higher level of needs becomes the prime moti- 
vating force, Power centralization is incom- | 
patible with the need hierarchy hypothesis. 
“The philosophy of management by direction 
and control, whether hard or soft, is inade- 
quate to motivate because the human needs 
on which this approach relies are relatively 
unimportant motivators of behavior in our 
society today | McGregor, 1960, p. 42]." The 
self-control afforded participants in group 
decision making serves to satisfy higher level 
human needs. 

Group decision making and other com- 
ponents of the human relations approach to 
organizational design rely heavily on Mas- 
low’s contribution. However, empirical evi- 
for the need hierarchy hypothesis is 
While managerial research has 
ance of higher level 
portunities for their 


dence 
equivocal. 
demonstrated the import 


needs and differential op 
fulfillment (Porter, 1964; Porter & Lawler, 


1968), field studies have supported only a 
two-step hierarchy of motives, with the social, 
ego, and self-actualization motives comprising 
the higher level (Hall & Nougaim, 1968; 
Lawler & Suttle, 1972). 


Individual-Organization Integration 


While group decision making may be in- 
strumental to the fulfillment of several mo- 
tives, Argyris (1957, 1964) principal focus is 
on its utility for seli-actualization, The de- 
mands placed on people by the classical 


Arg yris 
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bureaucratic form of organization tend to 
thwart any movement toward self-actualiza- 
tion, which Argyris views as a complex motive 
comprised of: psychological success, selí- 
awareness and control, independence, activity, 
personality growth, high aspiration levels, 
deep interests, and long-time perspectives. 
Where tasks are simplified, chains of com- 
mand are pyramidal and rigid, spans of con- 
trol are narrow, and so forth, the worker must 

- behave in an environment where he has very 
little power, may develop a few shallow abili- 
ties and a short-time perspective, and is 
expected to be passive and dependent. Argyris 
recommends group decision making as one of 
several interdependent paths (including lab- 
oratory education in interpersonal compe- 
tence, job enrichment, and reality-centered 
leadership) to the resolution of individual- 
organization conflict. When group decision 
making involves issues relevant to partici- 
pants' work roles, it acts as a mechanism for 
increasing responsibility, autonomy, апа 
achievement. Members can view the system 
in broader perspective, use their interaction 
and creative skills, and develop multidirec- 
tional power relationships. 


Power Continuum and Expectations 
Blake and Mouton (1961) have studied a 


"power spectrum" ranging from complete 
leader power (1/0) through collaboration 


(.5/.5) to complete subordinate power (0/1). 
Their research showed that as dyadic power 
relationships went from 1/0 to 0/1, (a) lead- 
ers’ and subordinates’ perceptions of responsi- 
bility changed curvilinearly with both maxima 
occurring at .5/.5; (b) subordinates’ satis- 
faction increased linearly, and leaders’ satis- 
faction decreased linearly. The optimum point 
on the spectrum, in terms of maximizing the 
felt responsibility of both leaders and subordi- 
nates and of equating their satisfaction, was a 
condition of power equalization. 

The power expectation hypothesis suggests 
that affective responses should be a function 
of differences between power expectations and 
the extent to which these expectations are 
met. Horowitz (1964) varied эш, Ет 
actual power relationships and paren: Ше 
effects of four comparisons on subordinates 


affect: (a) .5/.8 superior expectation, 1/0 
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subordinate expectation, and .5/.5 actual dis- 
tribution; (0) .5/.5, .5/.5, and .5/.5; (с) 
1/0, 1/0, and 1/0; and (d) 1/0, .5/.5, and 
1/0. Liking for the superior was greatest 
when 1/0 expectations were met, second high- 
est when subordinates expected no power and 
exercised equal power, and lowest when they 
expected equal power and exercised none. The 
optimal discrepancy between expected and 
actual power remains an empirical question. 
Likert (1961) suggests that this discrepancy 
should be slightly positive, That is, the sub- 
ordinate should exercise more power than he 
expected, but not so much as to exceed his 
decision-making capabilities, 


Likert’s Participative Group Management 


Likert (1961, 1967) considers group deci- 
sion making an essential feature of manage- 
ment. His early research indicated that effec- 
tive organizational units, as contrasted with 
ineffective units, (a) had employee-centered 
rather than job-centered leaders; (b) main- 
tained general rather than close supervision; 
and (c) allowed freedom for the work group 
to set its own goals and pace. Satisfied 
workers valued group decision making and 
reported that their supervisors asked for their 
ideas and frequently acted on them, Power 
sharing is part of Likert's leadership principle 
of "supportive relationships," which underlies 
effective communication and motive fulfill- 
ment. He has presented data from business; 
industrial, voluntary, and government organi- 
zations which support the efficacy of his “SYS” 
tem 4” or participative group form of organi- 
zation (Likert, 1967). 


Tannenbaum's Influence Pie 


Tannenbaum (1968) has presented one of 
the more provocative theoretical innovations 
concerning power sharing. He argues that the 
total amount of power in an organization nee 
not be conceptualized as a fixed quantity, 
where changes in power relationships necessi- 
tate a give-and-take process. R 

The fixed pie assumption has prevailed in 
previous research. The expectation model con- 
siders the consequences of varying relative 
proportions of a fixed power pie allocated to 
two persons or organizational levels. Blake 
and Mouton's (1961) power equalization, or 
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3/5 distribution, involves each person exer- 
csing half of the total power. Mechanic 
(1962) noted that decision strategies involve 
value judgments on the merits of retaining or 
delegating power. Tannenbaum, however, sug- 
Rests that total group or organizational power 
'S variable and that proper management prac- 
tices can expand the pie, 

Nonsupervisory personnel in 31 depart- 
ments of a service organization indicated on 
Independent rating scales the influence that 
they, their supervisors, and department mana- 
Sers exercised over department activities. 
Respondents attributed greater influence to 
Successively higher levels. However, for the 
top One-third of the departments in terms of 
Productivity, as contrasted with the lower 
third, influence was higher at all three or- 
Sanizational levels (Likert, 1961, pp. 55-56). 
"IDport has also been found for the hypothe- 
RIS Of expanded influence pies in Leagues of 
Vomen Voters (Tannenbaum, 1961), labor 
unions (Tannenbaum & Kahn, 1958), insur- 
ünce agencies ( Bowers, 1964), utility plants 
aon & Hoffman, 1960), and a factory in 

'g0slavia (Zupanov & Tannenbaum, 1968). 

Mechanisms of influence pie expansion are 
pe to Likert's interaction-influence sys- 
aa : hen a leader shares power, his own 
With f end increase through his interaction 
buts 0 lowers and acceptance of their contri- 
ii ns, which build a base for subordinates 

Ponsiveness to the leaders’ structuring be- 
Aviors. If the leader maintains influence over 
er Periors (Pelz, 1952), his group can in- 
Mel Its total power. This theoretical posi- 
Th ris assumes that subordinates infiuence 
ш cision making is instrumental to decision 

P mentation at their organizational level. 
adaj cation of these mechanisms requires 

Onal data, 


Criticisms or POWER SHARING 
Although many contemporary organization 
t ew. endorse the sharing of social power, 
Cont, €sis is not universally accepted. Several 

“Mporary writers have tried to refute or 


Mo, = 
[ > Gili 
( Te the power-sharing thesis. Gilman 
lim; ^)» arguing that power sharing fails to 

nit 3 advo- 


Workers’ power-seeking behavior, 
middle-of-the-road approach 
"is exercised through both formal 


where 
and 


informal organization processes and is granted 
from below as well as delegated from above. 
Leavitt (1965) concluded that the power- 
sharing propositions seem essentially correct 
but they have been overgeneralized and lack 
empirical support. Much of the power-sharing 
research is inadequate either in its specifica- 
tion of power-sharing techniques or its evalu- 
ation criteria. Sensitivity training studies 
elaborate intragroup processes but rarely in- 
voke adequate criteria (Campbell & Dun- 
nette, 1968). Neither early field experiments 
on group decision making (Coch & French, 
1948; Morse & Reimer, 1956) nor research 
conducted in a correlational paradigm (Likert, 
1961) has isolated power sharing per se as a 
cause of organizational effectiveness (Wood, 
1970, Ch. 1). 

Some concomitants of power sharing may 
also be incompatible with its goals. Power 
sharing may generate cohesiveness and loy- 
alty and inhibit creativity. The unlimited 
communication of information across levels 
may exceed the information-filtering and se- 
lectivity processes in the organization. Unre- 
strained power sharing may also ignore indi- 
vidual differences in the motivation to exer- 
cise power. 

Strauss (1963) has presented the most 
comprehensive critique. His basic points can 
be summarized as follows: (a) The assump- 
tions about human motivation on which 
power sharing rests are inaccurate. Such as- 
sumptions are based on value judgments 
made in academia and rarely converge with 
workers’ values. In particular, economic moti- 
vation is underemphasized. Recent discussions 
of the instrumentalities of economic reward 
for higher level need fulfillment weaken this 
criticism (Lawler, 1971; Opsahl & Dunnette, 
1966). However, Strauss suggests that we 
underplay job satisfaction and accelerate 
automation to provide more leisure time in 
which man can strive to self-actualize. (5) 
Power sharing ignores organizational costs. 
Gains attributable to power sharing may not 
compensate for the costs of its implementa- 
tion. (c) Power-sharing programs produce 
dysfunctional consequences, specifically: (i) 
Resulting cohesiveness may work against or- 
ganizational goals; (ii) an initial exercise of 


power may generate expectations for continu- 
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ous participation, which management may not 
be able to meet; (iii) participants who do not 
exercise power, for example, who have their 
ideas rejected, may become hostile and ali- 
enated; (iv) group decision making can be 
frustrating and generate decisions of question- 
able quality; (v) group decision making can 
readily become a manipulative tool of man- 
agement to attain its ends. 

Many of these criticisms seem valid. Yet, 
it would be as premature to totally subscribe 
to them as it would be to adopt power shar- 
ing as an organizational panacea. Resolution 
of theoretical conflict concerning power rela- 
tionships and group decision making requires 
research on the nature and consequences of 
group decision-making processes and stipula- 
tion of boundary conditions for power shar- 
ing. 


SrrUATIONAL-PROCESS APPROACH 
Group Decision-Making Process 


Group decision making has been the most 
extensively studied means of sharing power. 
However, neither the operational referents of 
this process nor the components responsible 
for its effects are clearly defined. The ap- 
proach diagrammed in Figure 1 suggests con- 
sideration of three elements of the process: 
(a) participation in phases of decision mak- 
ing, (b) multiple power inputs, and (c) inter- 
personal interaction in the process. 

Participation im decision process phases. 
Researchers have defined participation as: 
joint decision making by two or more parties: 
decision making by subordinates: the amount 
of talking or other demonstrative behaviors: 
and the amount of influence exercised." The 
assumption of participation—influence iso- 
morphism has been particularly prevalent in 
research on decision-making styles. Tannen- 
baum and Schmidt (1958) discussed seven 
styles, ranging from the leader's making a 
decision and announcing it, to his permitting 
subordinates to contribute within leader-de- 
fined limits. These styles varied (concep- 
tually) along a leader-power continuum, Hel- 
ler and Yukl (1969) also placed delegative, 


J. Anderson. The effect of participation and 
consideration on subordinate job attitudes. Unpub- 
lished manuscript, University of Illinois, 1969. 
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consultative, and joint decision-making styles 
along an inlluence continuum. Such assump- 
tions will be valid, however, only to the ex- 
tent that manipulations of participation re- 
late to the exercise of influence and to mem- 
bers’ perceptions of influence relationships. 

An alternative approach conceptualizes 
group decision making as a process comprised 
of phases in which organizational members 
may have an opportunity to take part (par- 
ticipate), Based on theories of executive deci- 
sion making (Bass, 1965; Cyert & March, 
1963; March & Simon, 1958) and on empiri- 
cal assessments of problem-solving interactions 
(Bales & Strodtbeck, 1951), a tentative 
formulation of the process incorporates three 
phases—generation, evaluation, and choice of 
alternative resolutions of a decision issue. 
Variations of group decision making involve 
complete-process participation and forms of 
group decision making where subordinates 
take part in only one or two phases. Pa rtial 
group decision making is common in organi- 
zational practice. A leader may submit à 
fixed set of alternatives to group evaluation: 
he may seek generation of alternatives 
through “suggestion box" programs; or he 
may involve subordinates in only generation 
and evaluation, as implied in the following 
statement: 


I would not think of making a decision by going 


around the table and then deciding on the of 
how everyone felt. Of course, I like to hear € эпе, 
but then I go off alone and decide. The decisions 


that are important must be made alone [R. M. 
Nixon, as quoted by Schecter, 1972, pp. 18-191. 


Vroom (1973) has incorporated a similar 
decision-phase concept in developing a mode 
of organizational decision making. He pre 
sented five decision styles for group issues: 
ranging from “You solve problem or make 
decision yourself using information available 

.." to “You share problem with your sub- 
ordinates as a group. Together you generate 
and evaluate alternatives and attempt 0? 
reach agreement (consensus) on a solution 
[italics added |." Intermediate styles involve 
the leader's gathering information from sub- 
ordinates individually, or allowing subordi- 
nates, individually or as a group, to generate 
(and perhaps evaluate) alternatives. 


1 
| 
| 
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3 With this framework, one can examine the 
effects of subordinates’ participation in deci- 
sion pha on the development of power 
| relationships. Laboratory research has shown 
| [ек eau qase sroup decision making 
| Ч ates higher levels of subordinate influ- 
ence than does partial group decision making. 

x This effect also holds for the group influence 
pie; subordinates in complete and partial 


| 
| Broup decision making conditions reported 
| ТОНЕ levels of leader influence (Wood, 1970, 
| 972а). Influence also depends on the par- 
| E phase in which members participate. 
p members who participated only in 
generating alternatives perceived their influ- 
епсе as slightly higher than did members who 
6 Only evaluated alternatives; members in- 
га опу in final alternative choice de- 
Scribed their influence as significantly greater 
than did participants in either of the other 
phas 
Multiple power inputs to group decision 
making, Participants in group decision mak- 
ing may exercise influence derived from 
multiple bases of power. Social power, or 
One's capability of influencing (i.e., affecting 
the behavior of) another (Cartwright, 1965), 
develops from several sources. These multiple 
a ei different rationales for one’s 
nis as with the influence attempts of m 
in PS Veber ( 1961) argued that power vested 
reaucratic positions Was of three “types 
legal authority legitimized by rational law 
ы 115 formal application, traditional author- 
к derived from the sacredness of the social 
kis and charismatic authority based 2 
ties, vers’ devotion to leaders’ heroic qua 


distinction 
and “au- 
(1960) 
and 


bg nd (1938) introduced the 
veen “authority of position 
rept eco leadership," which Bass 1 
дон in terms of "position powel La 

Sonal power." Interpersonal exchange 
mg distinguishes fate power and Lines 
Tefers (Thibaut & Kelley, 1959). Fate ро 
оп th Y the automatic effect of опе 5 be ed 
— € interaction outcome for another per 


ty g н о ааа 

A. К. Cooper, Satisfaction, perceived influere 
"ecision. riskiness: The effects of parium 
RU Cpr sia ч = itt Е 
lisheg Mmitment in decision-making TOU: Ed 
1955 Hectora] dissertation, Ohio State University. 


son; in exercising behavior power, one varies 
his behavior to make it desirable for the other 
person to change his behavior. Etzioni (1965) 
has Suggested classifying organizations ac- 
cording to their dominant power base. The 
type of organizational commitment will de- 
pend upon whether the primary power is (a) 
coercive, based on physical sanctions; (0) 
utilitarian, based on material rewards; or 
(c) identitive, based on identification with 
reference group norms. 

French and Raven's (1959) description of 
five power bases is probably the most widely 
accepted version of multiple power bases. 
Legitimate power derives from authority as- 
signed to positions. Legitimate power may be 
greater in positions of centrality (Mechanic, 
1962). Power may be based on expertise 
gained through experience and training. А 
leader may exercise referent power if follow- 
ers consider him a member of their primary 
reference group. Hollander and Webb (1955) 
and Hurwitz, Zander, and Hymovitch (1968) 
have substantiated an association between in- 
terpersonal attraction and the exercise of 
power. Referent power is more likely, however, 
to be a source of subordinate influence exer- 
ed through coalitions or the informal orga- 
Mechanic, 1962). Coercive power, 


based on the application of physical force, is 
power, though it 


another possible source of 

rarely occurs in modern organizations. Con- 
trol of rewards provides a common power base 
through which workers’ needs may be satis- 
fied, This source of power may derive from 
task-environmental rewards or rewards asso- 
ciated with successful interaction (Collins & 
Guetzkow, 1964). These multiple power bases 
may interact in determining one’s influence 
т another person or the products of group 


cis 
nization ( 


over 
decision making. 
Multiple power 
organizational consequences. To 
that managers were reported to rely on refer- 
ent and expert power bases, their subordi- 
nates’ satisfaction and performance were high. 
Satisfaction was inversely related to reward, 
coercive, and legitimate power. Reward power 
also correlated negatively with performance 
(Bachman, smith, & Slesinger, 1966). Stud- 
ies of salesmen, college faculties, life insur- 
and production workers repli- 


bases relate differently to 
the extent 


ance agents, 
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cated the relationships of referent and expert 
power to satisfaction (Bachman, Bowers, & 
Marcus, 1968). However, reward power cor- 
related positively with satisfaction and per- 
formance in insurance agencies. Among pro- 
duction workers, satisfaction was directly 
related to legitimate power, but performance 
was positively related to referent power. 

Further work is needed on the effects of 
power bases and their development. Effects of 
.group decision making on power sharing and 
organizational effectiveness may depend on 
the interaction of multiple power bases across 
individuals and hierarchical group levels. Re- 

ch could be designed to test the hypothe- 
sis that the effects of power bases should in- 
teract with phases of the group decision-mak- 
ing process. It appears reasonable to predict 
that expertise would be beneficial to generat- 
ing high-quality decision alternatives, that 
successful group evaluation would rely more 
heavily on referent power, and that choice 
behavior would depend on reward and legiti- 
mate power. 

Interaction in the group process. The pres- 
ent model also considers the extensiveness and 
qualitative nature of interpersonal interac- 
tion as important determinants of group deci- 
sion-making effects. Distinctions between deci- 
sion styles imply different degrees of leader— 
subordinate interaction. Individual leader 
decision making and delegation should yield 
different power relationships though neither 
approach involves extensive interaction. In- 
teraction in group decision making may, how- 
ever, affect differences in intersubordinate, as 
well as leader-subordinate power relations. 
Interaction should also be critical in the de- 
velopment of the additional intervening proc- 
esses of interpersonal affiliation and trust, 
which in turn should relate to organizational 
consequences. 

Research on choice shifts in decision be- 
havior indicates that the degree of interaction 
also affects the magnitude of choice shifts. 
Group decision making generally produces de- 
cisions of greater risk than does individual 
decision making. Several interaction-based 
mechanisms, such as diffusion of responsibility 
and social comparison of values, have been 
offered as explanations for this phenomenon 
(Pruitt, 1971). 
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Evidence also suggests that the utility 
of interpersonal interaction depends on its 
application to different phases of the decision 
process. Vroom, Grant, and Cotton (1969) 
found interaction to be functional in evaluat- 
ing decision alternatives, but not in generat- 
ing them. Qualitative interaction differences 
should also affect power relationships, since 
group members perceive different behaviors 
exhibited in group decision making to reflect 
different degrees of influence (Wood, 1970). 


Situational and Individual Differences 


The second basic focus of the situational- 
process approach is on the role of situational 
and individual variables. These classes of 
variables should moderate both the effects 
of group decision making on intervening proc- 
esses, such as power and affiliative relation- 
Ships, and the relationships between these 
processes and organizational outcomes, such 
as attitudes апа performance. 

Group decision-making theorists have rec- 
ognized the general role of situational con- 
straints, McGregor (1960) introduced the 
concept of selective adaptation wherein ap- 
propriate control mechanisms are adapted to 
the situation.’ He suggested that group deci- 
sion-making effectiveness will depend on man- 
agers’ skills, the organizational level where 
decision making occurs, and the nature of the 
decision issues (Ch. 9), Maier (1955) has 
similarly indicated that the success of group 
decision making will be bounded by an “area 
of freedom," defined in part by the nature of 
the task and the discussion skills of the group 
leader. Argyris noted individual differences in 
motivation to exercise power (1957) and, 
more recently, has defined some organizationa 
conditions requiring different decision-making 
structures and processes (1971). Blake and 
Mouton (1961) advocate their collaborative 
power relationship when interaction oppor 
tunities exist. Subordinates should exercise 


d + Я А = much 
+ Situational considerations were evident 


earlier in the work of Follett (Metcalf & Has 
1942), who attempted to incorporate employee тер чё 
sentation and decentralization in classical xdi. 
theory, Her “law of the situation" implied ke 
leaders must diagnose situations and elicit bet 
compatible with their demands in order to effective? 
exercise power. 
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Sreater power if they are exceptionally 
skilled: greater leader power is necessary in 
limes of crisis or when subordinates lack 
skills and information, 

Д Тһеогу and research should identify and 
integrate critical mediators, such as those 
variables associated with the organizational 
environment, decision tasks, and motives of 
participants, 

Organizational characteristics. There has 
been little research on effects of specific 
Situational variables in the organizational con- 
text on power relationships. However, re- 
Search and theory on broad organizational 
Parameters suggest that characteristics such 
as the general mission of the organization 
(Sloan, 1964), interdependence of functional 
Subsystems (Marschak, 1959; Porter & Law- 
ler, 1965), technology and programming of 
tasks (Woodward, 1965), and managerial 
climate (McGregor, 1960) may affect the util- 
ity of power sharing. Through their effects on 
Organizational structures and processes, these 
Parameters may generate extant balances of 
Power that preclude power sharing. Mulder 
and Wilke (1970) assume that group decision 
Making will not lead to power sharing, given 
ап expert power relationship favoring the 
eader, Rather, the leader-subordinate power 
Бар will widen unless the subordinates de- 
velop expertise by interacting with an expert 
leader, Mulder and Wilke argue that effective 
Sroup decision making requires participant 
“quality on abilities and motivational dimen- 
Sions, The exercise of power in decision mak- 
115 may also enhance expectations of exercis- 
‘NS power in the future and thereby may limit 
Organizational] flexibility in decision-making 
N proaches (Mulder, 1959). Shiflett and 
es (1972) manipulated the ri Div 
Dower of the leaders of task groups. se ^ 
тоге үө augmented, the kade aE How- 
ever niga directive behaviors. = nd 
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process; (b) little useful information is 
available at lower levels; and (c) time pres- 
sures on decision making are severe (Lowin, 
1968). Leavitt (1965) has also noted that 
organizational size and complexity may im- 
pose constraints on the feasibility of group 
decision making. 

Heller and Yukl (1969) surveyed the ex- 
tent to which managers operating under dií- 
ferent organizational conditions used the fol- 
lowing decision-making approaches: (a) own 
decision without explanation to subordinates, 
(5) own decision with explanation, (c) con- 
sultation, (d) joint decision making, (e) 
delegation. Decision styles varied by organiza- 
tional level and function. Student leaders 
most frequently used d above, senior business 
managers emphasized e, second-level super- 
visors used c and б, and b was the most com- 
mon method for first-line supervisors. Person- 
nel and general managers relied more on 
group decision making than did production 
and finance managers. Greater spans of con- 
trol were associated with Method b for sec- 
ond-level supervisors, and with either a or € 
in top management. Job experience was re- 
lated to e at upper levels and a at lower levels. 
Lower level managers more often applied 
group decision making to task than to group 
maintenance types of decisions. These data, 
while addressed to group decision-making 
utilization rather than outcomes, indicate a 
major role for situational moderating varia- 
bles. 
Decision task. A more general underlying 
situational concept may be associated with 
decision tasks. Wood (1972a) distinguished 
decision tasks as being facilitative or con- 
trastive, based on Deutsch's (1949) consid- 
eration of promotively interdependent and 
contriently interdependent goal structures, 
Deutsch’s promotive case implied cooperation 
in that an individual cannot attain his goal 
unless other group members attain theirs. 
A contrient goal structure, where the indi- 
vidual can reach his goal only if other mem- 
bers fail to reach theirs, generates intragroup 
Wood’s facilitation—contrast was 
the promotion-contrience 
as based on differing 
decision methods to 
es. Contrast could 


competition. 
less stringent than 
definition. Contrast W 
member preferences for 
be applied to particular issu 
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be extended to incorporate differing prefer- 
ences for the nature of the resulting decision. 
Facilitation referred to the case where, by vir- 
tue of compatible reference-group norms for 
shared power in resolving the issue, or con- 
vergence of outcome preferences, the task 
facilitated the mutual exercise oí power by 
superiors and subordinates. These íormula- 
tions imply that facilitation-contrast depends 
both on the specific issue under consideration 
and the decision-making participants. А con- 


trastive situation is similar to Vidmar's 
(1967) conceptualization oí a negotiation 


group, where members belong to reference 
groups with conflicting views, and these role 
forces compete with the force toward generat- 
ing acceptable, constructive decisions. Lowin 
(1968) has suggested that expansion of the 
influence pie is more probable when leaders 
and subordinates are not contriently related, 
Wood's data supported the hypothesis that 
power sharing should be more evident with 
facilitative than with contrastive tasks, Sub- 
ordinates in male decision groups reported 
greater subordinate and leader influence in 
facilitative task situations. Subordinates’ as- 
sessments also indicated that “power equaliza- 
tion” (no significant difference between 
leader and subordinate influence) occurred 
only when subordinates participated in the 
complete (generation-evaluation-choice) deci- 
sion process of resolving facilitative task is- 
sues. Further work on the determinants of 
facilitation. should examine task properties 
generating facilitative relationships, individual 
characteristics associated with goal structures, 
and attitudes toward decision-making roles. 
The work of Vroom (1973) should increase 
our understanding of the situational facilita- 
tion of group decision-making approaches. He 
has examined the use of variations of decision- 
making strategies in situations involving dif- 


ferent “problem attributes," One attribute 
refers to the likelihood of conflict among 
subordinates in preferred solutions. This 


corresponds closely to Wood's preliminary 
definition of task facilitation-contrast. In ad- 
dition, however, Vroom has incorporated in 
his model other potentially critical attributes 
of the problem, including intermember trust 
to base solutions on organizational considera- 
tions, the importance and likelihood of ac- 
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ceptance and decision implementation, the im- 
portance of decision quality, and the impor- 
tance and accessibility of information held by 
various members. Normative rules specify the 
appropriateness of decision styles given the 
problem conditions. These rules generate a 
set of feasible approaches that protect both 
quality and acceptance criteria. One can then 
select an approach from the set of possible 
styles on the basis of minimizing man-hour 
investments in the decision process. Data 
from a cross-cultural sample of managers 
indicated that decision makers frequently vio- 
late the proposed decision rules, particularly 
those protecting acceptance. The further re- 
sults of Vroom's work, including tests of the 
fit between his normative model and descrip- 
tive models, and the training of managers to 
diagnose problem attributes, will deserve the 
attention of group decision-making theorists. 

The foregoing research implies that for 
participative management to be effective, 
either the group approach must be limited to 
subsets of situations, or long-term involve- 
ment in such a system should reduce the 
contrast of decision tasks through the devel- 
opment of mutual trust and openness. Evi- 
dence from a field experiment with 10-14- 
year-old boys in athletic training showed 
greater effects of participation in later trials 
(Veen, 1972). However, intervening processes 
operating over time were not cleanly isolated. 
Longitudinal studies are especially critical ue 
light of recent evidence that group powe! 
structures interact with task difficulty and the 
time of criterion measurement in their effects 
on group performance (Levine & W wt 
1971). Groups with both balanced contro 
structures and high levels of total group сой. 
trol performed best in the later stages of M 
problem-solving tasks. However, while Le 
anced control groups were also superior - 
unbalanced groups in the late part of ш) 
tasks, the differentiation of amounts of con 
trol had its effect in the early trials, wes 
high-control groups outperformed low-contr?. 
groups. Moreover, in addition to ie 
(1968) prescription that group decision (45% 


е 38 < the 

be directly relevant to the раар, 
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work of Levine and Weitz indicates i 
more 


differences in power distribution become 
important on difficult tasks. 
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Individual motives. Another class of varia- 
bles that may account for variance in the 
effectiveness of group decision making in- 
Volves participants" motivational differences. 
Individual differences in motivation seem most 
directly relevant to attitudinal effects, which 
Will depend on motive structures and the 
Opportunity provided for motive attainment 
by group decision making. Previous research 
Supports this general proposition. Vroom 
(1960) measured individual needs for inde- 
pendence and authoritarianism, perception of 
Participation, overall job satisfaction, and job 
Performance among managers in a national 
delivery company. Significant correlations be- 
tween participation and attitudes were ob- 
tained only for subsets of the sample with 
Strong and moderate independence needs, and 
moderate and weak authoritarianism needs. 
Similar relationships occurred on the per- 
Ormance criterion, although between-subset 
Correlation differences were not significant. 

Differences attributed to specific motives 
May also be reflected in broader cultural val- 
Чез. French, Israel, and Аз (1960) attempted 
to replicate the Harwood experiment on re- 
Sistance to change (Coch & French, 1948) in 
a Norwegian factory. The expected beneficial 
Tesults of group decision making did not occur, 
Presumably, in part, because of the incom- 
Patibility of cultural norms with lower-level 
Workers? acceptance of decision-making re- 
SPonsibility, Hulin and Blood (1968) have 
Introduced the concept of "alienation from 
middle-class norms" to explain urban, blue- 
Collar workers’ negative reactions to efforts 
to enrich their jobs. Workers who are alien- 
Aled from middle-class, Protestant-ethic work 
Values, as indexed by such community varl- 
ates as standard of living and population 
density, tend not to be satisfied with job en- 
"степу programs, Inasmuch as such pro- 
Brams involve extending responsibility to sub- 
ordinates, these generalizations may apply to 
dieis forms of power sharing, such as ErOUP 
„оп making, Etzioni (1965) has SUBr 
ана this extension, in stating: "It is in tace 
ae where alienation is not воа 
соту; . that organizational ейог 5 ir 
„mitment and to control expressive activ! 
€S through such mechanisms as . . . partici- 


pation . . . in decision making are successful 
| pp. 668-669 |.” 

While these approaches suggest that we can 
determine subsets of the population which 
will respond favorably to power sharing, it is 
also possible that power-sharing programs will 
generate attitude or behavioral change within 
a heterogeneous group based on multiple 
motivational mechanisms, The strength of 
participants’ power motivation or related ego 
motives should predict responses to power 
sharing. Lowin (1968) notes that economic 
motives can also be met if participation is 
associated with interdependence of financial 
rewards. One might therefore expect favorable 
responses to group incentive systems, particu- 
larly those which are participatively devel- 
oped. Yet, when group participation and indi- 
vidual reward are combined in a system, it is 
of interest to examine the contributions of 
each component to group effectiveness. Re- 
cent work underlines the importance of the 
power-sharing element. Among  building- 
maintenance workers, a participatively devel- 
oped incentive system significantly improved 
job attendance rates, while the same system 
(equivalent amount of and basis for reward) 
imposed by management failed to elicit be- 
havior changes (Lawler & Hackman, 1969; 
Scheflen, Lawler, & Hackman, 1971). 

Group processes designed principally to re- 
distribute power may also satisfy social mo- 
tives. Affiliation can accrue from group deci- 
sion making, particularly when members in- 
teract over time and under conditions which 
facilitate agreement and joint goal-seeking 
activities. Wood ^ (1970, 1972b) has shown 
that both power and affiliation motives 
moderate attitude outcomes of group decision 
making. Influence perceptions were related to 
satisfaction only for group members with 
strong and moderate power motives. For in- 
dividuals with strong affiliation motives, per- 
ceived participation (involvement in the 
group process) was significantly related to 
satisfaction, even though these attitude mea- 
sures were not related to the exercise of influ- 
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ence. The mediating effects of social motives 
also interact with the decision process and 
task parameters previously discussed. Facili- 
tative decision tasks were more conducive to 
satisfaction than contrastive tasks for people 
with strong affiliation motives, Moreover, par- 
ticipation in the choice phase of the decision 
process was related to satisfaction for group 
members with weak, but not those with 
strong, affiliation motives (Wood, 1970). 
Characteristics of situations, decision proc- 
esses, and the people involved in the process 
must be considered as interacting determi- 
nants of power distribution and its effects. 


SUMMARY AND GUIDELINES FOR FUTURE 
RESEARCH 


This review of power sharing and group 
decision making implies a need to conceptual- 
ize organizational power relationships in terms 
of a model which will account for the psycho- 
logical mechanisms underlying shifts in power 
distributions and which defines the boundary 
conditions for the useful implementation of 
power-sharing programs in organizational de- 
velopment. A basic implication of the prelimi- 
nary situational-process model is that a viable 
theory of power in organizations must deal 
with differences in situations and individuals 
who perform organizational roles. This ap- 
proach is consistent with previous models of 
leadership processes, Characteristics of organi- 
zational environments, decision tasks, and in- 
dividual motives will affect both perceived 
distributions of power resulting from organi- 
zational change programs and individuals’ 
affective responses to power sharing. Further 
research is needed to isolate the specific 
variables within these broad categories of 
boundary conditions which contribute to vari- 
ance in power distribution and behavior, to 
systematically test their interactions, and to 
develop basic theoretical dimensions of en- 
vironments, tasks, and people. 

A related implication involves the concep- 
tualization of power as an intervening process 
in organizational development, rather than as 
a structural given or terminal effect of planned 
change. Power relationships are viewed as 
deriving from multiple inputs, in terms of 
decision-making strategies and other organi- 
zational design variables, and as leading to 
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multiple outputs in terms of organizational 
consequences such as member attitudes and 
behaviors. The situational elements of the 
model may differ in their effects of mediating 
input-power and power-output relationships. 
Structural and task mediators may be most 
important in the former linkage, and motiva- 
tional mediators in the latter. Further, specific 
motivational mediators may affect both link- 
ages. Power motivation affects individual in- 
fluence attempts," as well as attitudes result- 
ing from the successful exercise of influence. 
The model also incorporates other intervening 
processes associated with group decision mak- 
ing. Affiliative processes may relate to social 
motive fulfillment (Wood, 1972b). Interper- 
sonal trust should enhance commitment and 
decision implementation (Lawler & Hackman, 
1969). 

Examination of these intervening processes 
is crucial if we are to understand the dynam- 
ics of organizational change. Many change 
programs involving role design and personnel 
development are intended to directly affect 
power relationships. On the other hand, pro- 
grams may be designed to improve role atti- 
tudes and behaviors but fail to pay explicit 
attention to intended or serendipitous changes 
in intervening processes. Job enrichment and 
human relations training are often adopted as 
treatment packages to generate change. How- 
ever, the focus is generally on the relation be- 
tween the treatment and the organizational 
consequence rather than on intervening mech- 
anisms which might yield changes in the con- 
sequence. Moreover, programs designed to 
affect organizational behavior through changes 
in power relationships may generate changes 
in behavioral outcomes, but not via the power 
mechanism. In a reported 2-year program 
incorporating group decision making, earnings 
development, and training, performance im- 
proved 30%, but there were no significant 
changes in power relationships (Marrow, Bow- 
ers, & Seashore, 1967). А 

The present approach also suggests exam” 
nation of effects of power relationships ОП 
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multiple organizational consequences. Criteria 
for group decision-making effects have varied, 
and several studies have shown different treat- 
ment effects on different criteria (Marrow 
et al, 1967; Morse & Reimer, 1956). It 
тау be helpful to distinguish classes of 
Consequences based on immediacy of out- 
Comes, "The more proximal class includes char- 
acteristics of decisions, particularly their qual- 
Цу, riskiness, and acceptance by organiza- 
tional members, Group decision making pur- 
Portedly produces an optimal balance between 
decision quality and acceptance. Maier (1955) 
has noted that acceptance is the major benefit 
Of group decision making. Further, to the 
extent that participants demonstrate task ex- 
pertise, decisions should be of high quality. 
Reported. data support the acceptance hy- 
bothesis in terms of compliance and the exe- 
cution of decisions. The quality of group de- 
cisions, however, has not been clearly docu- 
mented, and is difficult to evaluate without 
quality assessments independent of the group 
Participants. 

The more distal set of consequences of 
Stroup decision making and power sharing in- 
cludes organizational role performance and 
attitudes, and other measures of system effec- 
tiveness, Predictions of attitudes follow from 
Psychological theory underlying power shar- 
ing if we formulate hypotheses based on mo- 
tive fulfillment as an attitude determinant 
and consider the multiple motivational mecha- 
nisms operating in group decision-making 
Strategies, Relationships of group decision 
making and power sharing to role performance 
Should depend on additional organizational 
Parameters, such as decision-reward contin- 
Sencies and individual expectancies (Lawler, 

971: Vroom, 1964, 1970). Contributions of 
Power-sharing approaches to system effective- 
ness should depend on the longevity and per- 

3slveness of the approaches and the feedback 

their consequences to future decision pro- 
“dures and organizational development pro- 
Brams. 

Y Future research efforts can be directed to- 
mur conceptual development of n нч ез 
S of the situational-process model, 

ective manipulation or measurement, and 

CIr causal linkages. Effective organizational 

Sign rests upon an understanding of indi- 
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vidual and situational mediators of effects of 
decision processes and power relationships. 
The validity and complexity of the model will 
influence the utility of situational change or 
adaptive treatment strategies. Adapting deci- 
sion methods to fit individual-organizational 
contexts assumes flexibility in organizational 
roles and effective means of assessing the con- 
text variables. On the other hand, the con- 
sistent use of power-sharing methods may re- 
quire extensive change in organizational struc- 
ture, decision problems, and individual atti- 
tudes. The resolution of the theoretical and 
practical issues concerning organizational 
power relationships will come only from sys- 
tematic laboratory research designed to iden- 
tify and link causal relationships, and from 
organizational applications designed to dis- 
cover critical interactive variables and to 
evaluate the comparative utility of different 
decision-making approaches and power dis- 
tributions. 
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A review oi the literature on implicit personality theory reveals that there 
has been considerable concentration on various techniques for measuring the 
theory. While such measures do have considerable overlap they also vary in a 


variety of ways. The major theoretical questions have involved personality 
correlates of individual implicit personality theory, relationship of the theo 


to linguistic structure, and whether it reflects the actual distribution of traits 
in other people. Questions are raised about whether traits are the most 
appropriate units of person cognition and whether perceivers see traits as 
distributed across situations as well as stimulus persons. 


In the first comprehensive review of re- 
search literature on person perception, Bruner 
and Tagiuri (1954) introduced the term 
"naive, implicit personality theory" to de- 
scribe the possibility that perceivers assume 
inferential relationships among attributes of 
people. While the term was a new one, the 
idea was relatively old. Furthermore, quite 
similar ideas were developed about the same 
time by Jones (1954), Steiner (1954), Kelly 
(1955), and Cronbach (1955). It is the pur- 
pose of this paper to explore the varieties of 
ways the term has been used, to review the 
issues which have been raised in experimental 
studies, and to suggest a broadened notion of 
implicit personality theory. 

Two traditions are represented in the no- 
tion of implicit personality theory. The first 
concerns the role of general bias in judgments 
of others, and the second has to do with indi- 
vidual differences in person perception. Vari- 
ous people who have examined trait ratings 
have noticed that perceivers tend to exagger- 


ate the size of relationships among traits 
(Newcomb, 1931; Thorndike, 1920). This 
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tradition has been represented on the one 
hand by those who see person perception an 
particularly impression formation as very 
much like concept formation or as similar to 4 
wide variety of judgmental phenomena (e£ 
Bruner, Shapiro, & Tagiuri, 1958; Wishner, 
1960) and by those who have been primarily 
concerned with an analysis of linguistic fac- 
tors in judgment (e.g., D'Andrade, 1965; Os- 
good, 1962). Thus in attempting to explain 
the pervasiveness of perceived trait covarla- 
tion, researchers have looked to a presumably 
more basic psychology of human judgment OF 
to biases implicit in our language. 

A second major tradition has been more 
concerned with individual differences among 
perceivers. Research on authoritarianism, “the 
New Look,” personality measurement, and 
personal construct theory all contribute tO 
this interest, 

The term “implicit personality theory” WAS 
initially used by Bruner and Tagiuri to de- 
scribe assumed relationships of traits. Clearly: 
however, the term could be used to corre 
spond to the broad conception of classic ре!” 
sonality theory, namely, a set of assumptions 
about why people behave the way they do. 
Cronbach (1955) attempted to view implicit 
personality theory as more than relationships 
among traits; specifically he suggested that a 
person’s “lay theory of personality” consiste 
of the mean and variance of each trait 45 
well as the covariance among traits. Except 
for Gross (1961) there have been almost пе 
explicit studies of the first two properties 0 
trait ratings. 
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Classic personality theories devoted consid- 
erable attention not only to the selection of 
basic terms at two or more levels of generality 
Or abstraction, but also to determination of 
dynamic and causal relations among them. 
That does not, of course, mean that naive 
theories of personality differ in content and 
dynamic properties, but the possibility is 
worth attention. A later section of the paper 

ill review the limited research on these as- 
pects of implicit personality theory. The first 
Sections, however, deal with research on the 
Perceived relationships among traits. 


MEASUREMENT Or TRAIT RELATIONSHIPS 
General. Procedures 

While most implicit personality theory re- 
Searchers have at the least obtained a matrix 
of trait relationships, there are considerable 
differences in how these relationships are gen- 
erated and what further analyses are per- 
lormed on this trait matrix. We may consider 
the following several points where differences 
arise: (a) assignment of traits to objects, (5) 
Creation of the trait matrix, (c) interpreta- 
tion of trait relationships, (d) reduction of 
the trait matrix to some more basic matrix, 
(e) interpretation of this basic matrix. 
Assignment of traits. Some researchers be- 
gin with several perceivers assigning traits to 
Stimulus objects. Most often, perceivers are 
asked to assign to each trait a magnitude for 
each stimulus object, for example, how much 
9f a particular trait each stimulus object pos- 
Sesses. The stimulus objects are typically 
People and the final result is a Perceiver X 
Tait X Stimulus Person matrix. . 

Generally, the perceiver dimension is ig- 
nored either by considering the Trait X Ob- 
Ject matrix separately for each perceiver, or 
Y creating a composite Trait x Object 
Matrix by summing or averaging across per- 
Eve Levy and Dugan (1960) have, how- 
Hint created a Composite Trait X Perceiver 
ti rix where each cell represents how many 
ed each perceiver had assigned each trait 
? all stimulus objects. 
y i pout eia, WM timulus 

having jects ign traits to 50 s 

Objects pes so а үгайз тау һе cal- 
ES ated acr 5 т 5 ts or across per- 
Cej oss stimulus О jects Е 

Wers from the Trait x Stimulus Person or 


en researchers begin 


Trait X Perceiver matrix, It is also possible 
to calculate profile distances among traits 
(Cronbach & Gleser, 1953) instead of corre- 
lations. In some instances this is more appro- 
priate (Rosenberg & Olshan, 1970) since cor- 
relations are sensitive only to similarities of 
profiles between traits across stimulus objects 
and not to mean differences among traits. 

Most researchers generate the trait rela- 
tionship matrix directly, In some way, sub- 
jects are asked to assess rather directly the 
relationships among traits. There are a num- 
ber of strategies for getting these data. 

Some studies have relied on ratings of simi- 
larity (Messick & Kogan, 1966; Pedersen, 
1965). The subject is presented with a pair 
of traits and asked to rate how similar or dif- 
ferent the traits are. 

Researchers have used likelihood of co- 
occurrence methods (e.g., Bruner, Shapiro, & 
Tagiuri, 1958; Hays, 1958; Lay & Jackson, E 
1969; Warr & Sims, 1965). Here the tech- 
nique is, Given that a person has Trait X, 
how likely is it that he has Trait Y? 

Hinkle? (also described in Bannister & 
Mair, 1968) obtained perceived causal judg- 
ments from subjects. Each subject was pre- 
sented with a construct and asked if a person 
were to change in that construct (e.g., change 
{тот intelligent to stupid), on which others 
of the 19 constructs used would he also 
change. The result is а matrix of perceived 
cause-effect inference. 

A fourth method used by Rosenberg (Ros- 
enberg, Nelson, & Vivekananthan, 1968; Ros- 
enberg & Olshan, 1970) obtains measures of 
co-occurrence. Each subject divides a total 
sample of traits into piles, each pile repre- 
ag a distinct person. Each trait can be 
used only once although Rosenberg reports 
the findings are not altered greatly if traits 
sed several times. The co-occurrences are 


sentir 


are u a 
converted into disassociation scores which 
basically depend on the number of subjects 


who disagree on whether two traits go to- 
t should be pointed out that one 
dv co-occurrences in various 
les or in any feature which 


gether. I 
could also stu 
situations or roles 
Сару N. Hinkle. The ch 
{тот the viewpoint of a the 
published doctoral dissertation, 
sity, 1965. 


ange of personal constructs 
ory of implications. Un- 
Ohio State Univer- 


296 DAVID J. SCHNEIDER 


serves to distribute traits ( Rosenberg & Jones, 
1972). 

Reduction and interpretation of the trait 
matrix. Any of the correlational, distance, or 
strength-of-inference measures can be sum- 
marized through a wide variety of analytic 
techniques. Factor analysis is often used to 
reduce a matrix of trait intercorrelations Any 
of the above measures which are interpreted 
as distances lead to a variety of multidimen- 
sional scaling analyses, Rosenberg and Sedlak 
(1972) has an excellent summary of the as- 
sumptions underlying these models, 

Both factor analysis and multidimensional 
scaling lead to interpretations in terms of di- 
mensions underlying the trait matrix. Ordi- 
narily, of Course, interpretation of these di- 
mensions is performed by examination of 
traits which load at the extremes of the dimen- 
sions. More Satisfactory analytic techniques 
have been worked out and used in recent 
years. Rosenberg et al. (1968) have advo- 
cated a regression technique in which some 
extraneous measure (such as a measure of so- 
cial desirability) is used as a dependent mea- 
sure and projections on the various dimen- 
sions become the independent measures. In- 
terpretation of the multidimensional space 
then becomes possible through the pattern of 


multiple with these dependent 
measur 


correlations 


Distance data may be analyzed in Ways 
which do not lead to dimensional summaries. 
For example, one reasonable alternative is а 
hierarchical Organization, Cluster analysis 
(Johnson, 1967) has been used to provide 
Such а summary of trait relationship data 
(Rosenberg & Jones, 1972), Interpretation of 
Clusters can be done through analysis of vari- 
ance; presumably those external traits which 
result in high within-trait homogeneity while 
distinguishing between clusters, that is, with 
high between-cluster relative to within-cluster 
variance, can be used to interpret. clusters, 


Individual Difference Me 


surement 
Individual differences among perceivers can 
be studied using variants of Most of the tech- 
niques discussed. One possibility is to divide 
berceivers into Separate groups on some д 
Priori criterion and to study Whether the 
Broups differ in trait relationships, Another 


method is to determine what individual dif- 
ferences are present in the data itself, The 
simplest technique would be to factor-analyze 
among perceivers (the correlations based on 
the extent to which the perceivers agree in 
how much of the various traits they see across 
all stimulus persons). There are also multi- 
dimensional scaling models which allow de- 
termination of individual viewpoints from the 
co-occurrence or similarity data themselves 
(cf. Rosenberg & Sedlak, 1972). 

The more general point, however, is that 
when the researcher begins with a Perceiver 
X Stimulus Object x Trait data matrix, rela- 
tionships among entities on one dimension can 
always be obtained across entities on a second 
dimension for any subset of entities on à 
third dimension, So, for example, it would be 
possible (although not necessarily interesting) 
to isolate classes of stimulus objects from the 
data and then to obtain correlations between 
traits and across perceivers for different 
classes of stimulus objects, 


Methodological Problems 


Since the range of techniques used in im- 
plicit personality theory research is relatively 
broad, it would be instructive to raise some 
questions about them. There have, however; 
been relatively few studies which attempted 
direct comparisons of methods, 

Selection. oj traits. Although it has been 
rted that factorial Structure does not vary 
greatly as a function of stimulus objects and 
traits (provided a Teasonably heterogeneous 
sample of both is provided), there are reasons 
to doubt these conclusions, Most studies have 
provided subjects with both stimulus objects 
and traits, Correlations among traits differ a5 
a function of both stimulus person ( Koltuv; 
1962; Veness, 1969) and relevance of trait 


a 


5 at 
(Koltuv, 1962). One might suspect 
linguisitic factors would predominate in та 


ings of unfamiliar people using unfamiliar 
traits. 

А second concern with selection procedim 
also comes in part from Kelly (1955). ня 
Suggested that traits were functionally p 
mensional so that people used both a pae 
and some opposite, Personal construct Ше 
demands that each trait be paired with 1 
“opposite” to form a construct, but in prac 
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lice, the Opposite or contrast pole is rarely 
used in research (Bannister & Mair, 1968). 
In practice, of course, subjects may implicitly 
Use a dimension instead of a single trait in 
implicit personality theory studies. For ey- 
ample, when asked to judge how likely it is 
that a warm person is also unhappy, he may 
In fact judge how likely a cold person is u;- 
happy or a warm person happy, and simply 
e his ratings. Subjects may have diffi- 
in making direct ratings of negative 
co-occurrence. Presumably, we are more likely 
to code those traits which are likely to co- 
Occur with one another rather than those 
Which do not со-оссиг, but this is an empiri- 
cal question which deserves further attention. 
However, there are apparently real diffi- 
Culties with the use of opposites. The research 
of Green and Goldfried (1965) poses the 
Most important problem, They had subjects 
Tate various concepts on several scales. The 
results indicate that correlations among oppo- 
Site traits are not highly negative as one 
Would expect, For example, for the concept 
sin, good correlated —.39 with bad which 
While Negative was far from high. On the other 
hand, pleasant correlated .39 with unpleasant, 
at least when sin was being rated, These re- 
Sults then indicate that so-called semantic 
Opposites may typically not behave as oppo- 
Sites in application. 
_ Assignment oj traits. Quite obviously, be- 
inning Írom a Perceiver x Trait x Stimulus 
2ЈесЕ matrix, correlations among traits can 
>e derived either across perceivers or across 
Stimulus objects. We do not know whether 
these different ways of calculating correla- 
tions would lead to different results. Perhaps 
even more importantly traits may be dis- 
tributeq across situations rather than people 
mae 1968) and. may be perceived ie 
i a пе results of Endler and Hunt B 9 
SES that situational variance and variance 
| Situationa] interactions with other varia- 
ANS. йге important in self-perception, for ex- 
ample, 
Og OPI parison of measures of trait relation- 
ii the level of the matrix of pes rela- 
“UDS, the obvious question is the extent to 


w ud " 
ch different ways of estimating trait rela- 


tio hi 
r 
ds 1р$ Correspon d. 


odd and Rappoport (1964) have com- 


pared several methods. Subjects generated 
their own traits and then relationships among 
traits were obtained in three ways: (a) Sub- 
jects rated 25 stimulus persons on each trait; 
(5) subjects rated idealized concept persons, 
lor example, "intelligent person," on each 
trait; (c) subjects save co-occurrence ratings. 
The ratings of stimulus persons were con- 
verted into correlations among traits and the 
co-occurrence ratings were converted to impli- 
cating ratings using the Hays (1958) method, 
The average correlation for the 14 subjects 
between estimates provided by the two corre- 
lational techniques was .83 and between the 
two correlational and the co-cccurrence tech- 
niques, .79 and .73. Furthermore, in another 
part of the study, subjects were asked to 
pick (a) three other traits implied by various 
other traits and (5) what traits would allow 
them to infer a particular trait. These choices 
were predicted nicely by the correlational and 
co-occurrence similarities, 

The major advantages of techniques which 
result in correlations among traits are that 
the subjects perform a natural judgment task. 
They assign traits to stimulus objects. Direct 
techniques on the other hand ask the subject 
to abstract relationships from his own expe- 
rience, and of course, for any given perceiver 
his estimate of how much he sees two traits 
as related need not agree with the covariance 
actually present in his judgments, 

It might also be pointed out that correla- 
tional estimates can provide only symmetrical 
relationships among traits (e.g., intelligent 
implies witty as much as witty implies intelli- 
gent) while direct measures allow for asym- 
metricalities which do occur (Schneider +: 
Warr & Knapper, 1968). For example, sub- 
jects see witty people as more likely to be 
intelligent than the reverse (Schneider, see 
Footnote 4). 

Finally, the question of microscopic versus 
macroscopic analysis has not been sufficiently 
discussed in this literature. As has been 
pointed out, researchers aim for a trait rela- 

1D. J. Schneider, Extra-linguistic aspects of trait 
implications. In Paul F. Secord, Recent approaches 
to studying implicit personality theories. Symposium 
presented at the meeting of the American Psycho- 
logical Association, Washington, D. C, September 


1971. 
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tionship matrix which is nearly айу re- 
duced to a set of factors or dimensions. Yet, 
the research of Todd and Rappoport (1964) 
indicated that while individual trait relation- 
ships seemed closely related, factor analysis 
of the correlational data and a multidimen- 
sional scaling of the co-occurrence data gave 
imensions which showed poor correspond- 
ce. Furthermore, such dimensions did not 
correspond closely with dimensions subjects 
provided for themselves. There is suggestive 
ence from the research on the importance 
of the stimulus object to indicate that while 
the stimulus object is not an important de- 
terminant of factorial structure, individual 
trait relationships may be affected by what 
objects or persons are being rated. 

The evidence on these points is merely sug- 
gestive. However, if it is true that factors and 
dimensions are not congruent with how per- 
ceivers think they see the world and factors 
and dimensions are relatively insensitive to 
perceiver and stimulus sources of variance, 
there is reason to consider seriously what lev- 
els of analysis are most worthwhile and for 
what reasons. 


OTHER Ways or ASSESSING IMPLICIT 
PERSONALITY THEORY 
Stimulus Persons 

Several investigators have taken the stimu- 
lus person as the unit of investigation. Here 
the technique is to obtain judgments of simi- 
larity among a variety of stimulus persons 
and then through multidimensional scaling 
determine stimulus person dimensions. Jack- 
son, Messick, and Solly (1957) had subjects 
rate the similarity of one another (all subjects 
were friends of one another), Messick (1961) 
and Warr and Knapper (1968, Experiment 6) 
scaled similarity judgments of political fig- 
ures. Wiggins and Hoffman (1968) had sub- 
jects compare profiles representing stimulus 
persons. 

The interpretation of such dimensions can 
be done in various ways. When the stimulus 
person has known characteristics (such as 
with political figures), interpretation was 
relatively easy. Jackson et al. (1957) ob- 
tained personality data on the stimulus per- 
sons used in their study so that interpretation 
in terms of personality traits was possible. 
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Warr and Knapper (1968) showed that the 
loadings of stimulus persons on various dimen- 
sions were related to their ratings on various 
semantic differential scales. Any dimension (s) 
on which stimulus persons differ could be used 
to provide an interpretation of the space of 


stimulus persons. Wiggins and Hoffman 
(1968) varied the characteristics of their 


profiles systematically so that interpretation 
in terms of any cue dimension or combina- 
tion of cues was possible through analytic 
procedures, 

It should be pointed out that correlational 
methods could be adapted to the measure- 
ment of stimulus persons. From. a Perceiver X 
Stimulus Person X Trait matrix, correlations 
could be obtained among stimulus persons 
across perceivers or traits. 

Generally, it seems that scaling of stimulus 
persons has been an under-used technique. 
Among other things it allows determination 
of the dimensions subjects, or groups of sub- 
jects, actually use in the judgments of people. 
Since subjects may be unable or unwilling to 
verbalize their effective dimensions, person 
scaling may be one indirect way to assess this. 


The Situation 


Situations, roles, etc., can also be scaled 
using either similarity or correlational meth- 
ods. At present we have little or no empirical 
evidence about the ways situational forces are 
perceived by people. While a determination of 
dimensions along which situations, roles, ОГ 
other social structures differ is interesting 10 
its own right, it may become an increasingly 
important problem in person perception. At- 
tribution research has always recognized the 
role of the situation in perception of behavior 
(Hastorf, Schneider, & Polefka, 1970). To the 
extent perceivers see traits and behaviors as 
distributed across situations or roles, a dete 
mination of situation dimensions may be cru- 
cial. 

STRUCTURE 

Because of the interest of personality 
theorists in cognitive structure during the 
1950s and of the general Gestalt impact on 
person perception it seems natural that Dues 
of the early work in implicit personality 
theory was concerned with structure. 
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Centralit v 


А Although Asch (1946) did not use the term 
implicit personality theory, his research in 
impression formation has profoundly affected 
theory and methodology in this area. One of 
his concerns was that of trait centrality: This 
is discussed at some length by Hastorf et al. 
(1970). 

As is well known, Asch found that warm- 
cold was a central trait in several stimulus 
Contexts. Unfortunately, there was never any 
good evidence on when a trait would or would 
not be central or indeed why any other trait 
Should be central at all. 

Two separate traditions for considering 
these problems emerged. The first was based 
оп a correlational model and the second on a 
model of trait implication. Wishner (1960) 
Was the first to give a correlational basis for 
a solution, He was able to make a convincing 
Case that the response traits most affected by 
the shift from warm to cold were also the 
traits which correlated most highly with 
warm or cold, Thus any trait could become a 
central stimulus trait if it correlated highly 
with enough response traits. Warr and Knap- 
Per (1968, Experiments 10-13) have repli- 
cated Wisher’s results closely using narrative 
Material as stimuli. 

Most of the research which followed has 
been animated by a concern with linguistic 
biases in trait ratings (e.g., D’Andrade, 1965; 
Kuusinen, 1969a; Mulaik, 1964). It is pos- 
sible to argue (Jackson, 1962) that centrality 
is closely related to the loading of a trait on 
the most powerful factor analytic dimension 
Which is typically evaluative in nature. 

A second approach to the questions of 
method and theory about centrality came 
from the literature on scaling methods. For 

ays (1958), traits were central to the extent 
that they implied other traits without being 
implied by them, This judgment can be con- 
Verted to a measure of distance. Hays (1958) 
did find that warmcold defined the most 
Prominent dimension in his analysis, but since 
he used a limited number of traits, his dem- 
Onstration was merely suggestive. : 

More recently, а large and careful scaling 
of 60 traits (Rosenberg et al., 1968) revealed 
that warm and cold lie at opposite ends of 


one important dimension of social evaluation. 
These authors were also able to show that the 
traits most strongly affected by warm—cold in 
the initial Asch data loaded heavily on that 
same dimension. Perhaps just as important, 
they found that there is also a dimension of 
intellectual evaluation. These two factors also 
appeared in a scaling of slang words used to 
describe Princeton students (Friendly & 
Glucksberg, 1970). 

Rosenberg et al. (1968) argued that traits 
which are extreme on the two dimensions 
would be central for traits on their respective 
dimensions. Zanna and Hamilton (1972) have 
shown that varying warm and cold in a list of 
traits affects only traits which are either 
strongly socially desirable or undesirable 
while varying industrious and lazy affect re- 
sponses only for intellectually desirable or 
undesirable traits. These results confirm the 
Rosenberg et al. hypothesis and confirm the 
position that centrality may be viewed as a 
special case of general trait relationships. 

In one sense, however, the centrality issue 
remains an important one. We could ask 
whether some kinds of traits chosen on an a 
priori criterion convey more information and 
imply other traits more strongly than other 
kinds of traits. For example, do socially de- 
sirable traits imply other traits more strongly 
than negative traits? Hamilton and his col- 
leagues (Hamilton & Huffman, 1971; Hamil- 
ton & Zanna °) have shown that in judgments 
of likability and evaluation negative traits 
are weighted more heavily than positive traits. 
Clearly more research is needed on the differ- 
ential power of positive and negative traits. 

One might also expect that traits which are 
more general or abstract might have stronger 
implications than less general traits. Hinkle 
(see Footnote 3) had his subjects generate 10 
subordinate and 10 superordinate constructs, 
and he found that the latter were more often 
involved in causal implication than the sub- 
ordinate constructs. He also found that the 
superordinate constructs were more resistant 
to possible change. 


5D. L. Hamilton & M. P. Zanna. Differential 
weighting of favorable and unfavorable attributes in 
impression formation. Unpublished manuscript, Yale 


University, 1972. 
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Additive Models 


A second issue which came from the Asch 
paper was an emphasis on additive versus 
nonadditive processing of information. Basi- 
cally, he opposed a simple model of traits 
adding to produce a final impression against 
a nonpredictive, nonadditive model which 
claimed that the final impression would be 
different for each new set of stimuli. It has 
already been shown that correlational and 
multidimensional scaling models which are 
basically additive (although not in the simple 
way Asch had in mind) can predict final im- 
pressions, Furthermore, research from a vari- 
ety of contexts (Anderson, 1968; Rosenberg 
et al, 1968; Wiggins, Hoffman, & Taber, 
1969) show that linear (additive) models 
account for almost as much variance as con- 
figurational models. Present evidence (e.g., 
Goldberg. 1968; Wiggins & Hoffman, 1968) 
indicates that while some people do process 
configurationally, the configurational models 
are not dramatically more efficient or accu- 
rate. On the other hand, there is some evi- 
dence that certain variables may affect trait 
relationships. Veness (1969) found that oc- 
cupation affected trait correlations. For exam- 
ple, the correlation between warm-—cold, and 
unhappy-happy was —.36 for lecturers but 
.29 for road sweepers (the study was done in 
England), Over 25% of the correlations dif- 
fered significantly for different occupations. 
Further research is likely to be directed to 
conditions which favor one kind of processing 
over another (Wiggins & Hoffman, 1968) and 
more research should be directed to the ex- 
istence of moderator variables such as occu- 
pation, 


Cognitive Complexity 


A final issue in the structure of implicit 
personality theory concerns cognitive com- 
plexity. At one point it seemed likely that an 
important aspect of cognitive structure was 
complexity, Various ways of measuring com- 
plexity have been reviewed by Crockett 
(1965) and Schroder, Driver, and Streufert 
(1967), among others. Most often complexity 
has been related to impression formation 
processes, a literature reviewed by Crockett 
(1965) and Hastorf et al. (1970). 
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The main concentration has been on indi- 
vidual differences in complexity. Related di- 
rectly to implicit personality theory some 
researchers such as Pedersen (1965) and 
Walters and Jackson (1966) have found some 
individual differences in numbers of dimen- 
sions used for trait implications, but the 
differences are subtle and minor. Various re- 
searchers such as Steiner (1954) and Warr 
and Sims (1965) have found evidence that 
some people rely heavily on an evaluative 
dimension, thus possibly limiting the number 
of other dimensions used. 

There are other studies which are related to 
cognitive complexity although they are not 
generally thought of in this framework. One 
could ask what features of cognitive set, the 
situation facing the perceiver, or the kind of 
stimulus person being judged have on the 
complexity of implicit personality theory. For 
our purposes high relationships among traits 
can be taken as evidence of relatively low 
complexity, since high relationships probably 
reflect fewer dimensions or lower independence 
of dimensions. 

The research of Zajonc (1960), Cohen 
(1961), and others on cognitive tuning are 
well known. This research showed that infor- 
mation was related in a more complex fashion 
when subjects expected to receive rather than 
to communicate the information. More re- 
cently O'Neal has shown that cognitive set 
affects trait relationships. In a first study 
O'Neal and Mills (1969) argued that when 
perceivers make a choice among people they 
desire to be certain and that, therefore, they 
try to increase the general positive valence 9 
the chosen person. That should result 1 
higher correlations among positive traits for 
chosen as opposed to unchosen people, 8 
result obtained in the study. In a secon¢ 
study, O'Neal (1971) showed that this halo 
effect was also increased by caffeine arousal 
and by decision importance. In both studies 
only positive traits were used since the in- 
terest was in trait covariance due to social 
desirability. However, it may well be that 
desires to be certain about an unknown other 
result in decreased cognitive complexity and 
a use of fewer dimensions of judgment. When 
certainty is needed such as when the subject 
must make a choice or when he is in a сот" 
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munication tuning set, "illusions" of cer- 
tainty may be achieved through decreased 
complexity of trait relationships. Thus, trait 
relationships may be higher for all traits if the 
above (or other) conditions apply. 

There is a possibility that greater complex- 
ity (as evidenced by lower trait relationships) 
may exist for certain types of stimulus per- 
sons. Unpublished studies by Nidorf and by 
Supnick cited by Crockett (1965) showed that 
information seeking and descriptions are less 
complex for people who were different from 
the subject in several strata and who were 
disliked. Koltuv (1962) found that correla- 
tions were higher among traits for unknown 
than for known others, even when social de- 
Sirability variance was partialed out. These 
results are hardly surprising; they are con- 
Sistent with the position that complexity of 
implicit personality theory is related to 
amount of information. 

Generally, considerably more 
Ought to be addressed to the conditions which 
increase or decrease the complexity of im- 
Dlicit personality theory. Complexity may be 
à feature of situations and stimuli as well as 
perceivers, 


attention 


REALISM ISSUE 


One can always raise the question of where 
these implicit personality theories came from. 
If the theory reflects the actual distribution 
of traits among stimulus persons, then there 
is an adequate explanation for the theory. 
Clearly, then, general implicit personality 
theory would be merely a reflection of the 
general distribution while individual implicit 
Personality theories represent the effects of 
idiosyncratic experience, 

Tt is reasonably easy to show that some 
Perceived relationships among traits differ 
Significantly from what is the best guess of an 
empirical relationship. Thorndike’s (1920) 
halo effect was based on just this kind of ob- 
Servation, This does not, of course, indicate 
that most perceivers operate with habitual 
Implicit personality theories widely at vari- 
ance with fact, Not surprisingly, there has 
been considerable research addressed to this 
question, : 

The first explicit attempt to show that im- 
Plicit personality theory exists independent of 


subject experiences was by Levy and Dugan 
(1960). They used 15 traits and their primary 
measure was how often each was used to de- 
scribe photographs. There was a separate 
group of pictures for each trait and no picture 
was judged on more than one trait. Each per- 
ceiver then had a score for each trait reflect- 
ing the number of times it was used in rating, 
Thus they argue correlations among traits 
could not reflect actual experienced co-occur- 
rence of traits in the sample of photos, Some 
of the correlations were substantial and a 
factor analysis produced four major factors, 
three of which were readily interpretable. 

Mulaik (1964) used a more traditional ap- 
proach. Using the same 20 traits he asked 
different groups of subjects to rate (a) real 
people or (b) stereotyped roles, He also had 
the 20 traits rated for meaning using a se- 
mantic differential technique. Three factors 
accounting for 60% of the variance in the 
semantic ratings were quite similar to factors 
from the first two studies (people judgment), 
It could therefore be argued that the ratings 
of people and roles were heavily influenced by 
linguistic factors, 

The question naturally arose as to whether 
personality ratings could in part be produced 
by implicit personality theory of the raters 
and not by actual co-occurrence of traits in 
the ratees. D'Andrade (1965) showed that 
ratings of similarity of trait names produced 
a factor structure close to one arrived at from 
actual ratings of people. Passini and Norman 
(1966) had perceivers nominate stimulus per- 
sons they hardly knew on a variety of dimen- 
sions used earlier by Norman (1963). Factor 
analvses of such ratings produced a structure 
which was strikingly similar to one produced 
by ratings of stimulus persons well known to 
the raters. Later research by Norman and 
Goldberg (1966) and Hakel (1969) showed 
that similar results could be produced in the 
complete absence of ratees. 

Lay and Jackson (1969) also addressed this 
realism question. They took items which were 
highly correlated with total scores on 21 
major dimensions of the Personality Research 
Form (Jackson, 1967), a form used to mea- 
sure Murray needs. Subjects were asked to 
assess the probabilities of a subject answer- 
ing one item true, given a true or false an- 
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swer for another item. Thus distances were 
generated among all 21 items and a multi- 
dimensional scaling analysis done. The first 
three factors corresponded highly with the 
first three factors obtained on analysis of 
subjects’ actual test scores. The argument 
was that since there are correspondences be- 
tween a measure of real personality and per- 
ceived personality, implicit personality theory 
must in part reflect the real world. Unfortu- 
nately, even this conclusion was not com- 
pletely justified. When subjects filled out the 
questionnaires, they acted as perceivers of 
their own behavior; hence the measures of 
"real" personality may also be biased by im- 
plicit personality theory. 

Another line of research bears on the real- 
ism issue, Chapman (1967) has shown that 
subjects assume high co-occurrences among 
stimuli which are linguistically related, a phe- 
nomenon he called illusory correlation. Chap- 
man presented pairs of words such that there 
were no systematic co-occurrences among 
them, but subjects recalled many more co- 
Occurrences of associatively related words 
than actually occurred, In later studies, Chap- 
man and Chapman (1967, 1969) found the 
same phenomenon for the relationship between 
symptom statements and certain classic signs 
on the Draw-a-Person Test and for Rorschach 
signs and judgments of homosexuality. Symp- 
toms which were associatively related to these 
projective tests were inferred from those signs 
by naive subjects to the same extent as by 
trained clinicians, Thus it appeared that in- 
ferential relationships among different kinds 
of information might have been systematically 
biased such that pieces of information which 
"should" go together were perceived as in 
fact co-occurring, 

Finally a provocative recent paper by 
D'Andrade* showed that in several studies 
the matrix of correlations among classes of 
behaviors rated immediately after social inter- 
action did not correspond highly with the 
matrix of correlations when the classes of be- 
haviors were rated from memory some time 
later. The “memory matrix” was, however, 
ав С. D'Andrade. Cognitive structure and judg- 
ment. Paper prepared for the Committee on Basic Re- 


search in Education workshop on Cognitive Organiza- 


tion and Psychological Processes, Huntington Beach, 
California, 1970, 


closely related to a matrix of linguistic simi- 
larity while the “immediate rating matrix” 
was not. This suggested that trait ratings 
made from memory were likely to be dis- 
torted by feelings of what ought to go to- 
gether. 

The data on the realism issue are not out- 
standing but what we have suggests that per- 
ceived trait relationships deviate from our 
best estimates of the “true” relationships. 
Future work is likely to be addressed to the 
question of what factors bias implicit person- 
ality theory, in what direction and in what 
magnitudes. Such research is likely to dis- 
cover that there is a variety of situational and 
cognitive set factors which bias implicit per- 
sonality theory in addition to cultural and 
linguistic factors. " 

LANGUAGE AND IMPLICIT PERSONALITY 

THEORY 


While in principle there may be many “bi- 
asing” factors which affect implicit person- 
ality theory, in fact, there has been concen- 
tration on only one, linguistic determinants 
The claim is made and indeed rarely ques- 
tioned that various kinds of linguistic struc- 
tures produce the implicit personality theory. 
It has been shown that semantic differential 
ratings of trait names produce a linguistic 
structure highly similar to the structure for 
other concepts (Kuusinen, 1969b). Other re- 
searchers have also obtained factors which are 
similar to semantic differential factors (Ped- 
ersen, 1965), and Rosenberg and Olshan 
(1970) were able to predict the multidimen" 
sional configuration of traits from factors 
which are quite similar to the Osgood factors 

Most factor analyses of trait names havê 
not produced all of the typical Osgood i 
tors, but there has been a tendency to id 
a large first factor which has been interpretec 
as primarily evaluative (Lay & Jackson, 
1969; Levy & Dugan, 1960; Pedersen, 1965; 
Walters & Jackson, 1966). Of course, the 
early research on halo effect paid particular 
attention to such factors. 

Most recently there has been a substa ы 
controversy about the nature of eala 
bias in person perception. The problem 15 pes 
in trait ratings there are generally denotativ 
characteristics of the factors in addition (0 
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connotative characteristics. For example, Wal- 
ters and Jackson (1966) noted that their 
first factor is characterized by high ratings for 
items concerned with interpersonal warmth 
and sociability, but it is also the case that all 
the socially desirable terms load positively 
and the socially undesirable terms load nega- 
tively on this factor. Of course, warmth and 
sociability are themselves evaluatively loaded 
terms so this apparent confounding between 
evaluation and content is not surprising. 

The first researcher to devote considerable 
attention to separating out these aspects of 
Meaning was Peabody (1967). He organized 
traits into groups of four, Each group was 
arranged into a 2 X 2 block with descriptive 
and evaluative similarity being the relevant 
dimensions, Thus each trait (e.g, cautious) 
Was associated with a trait which means much 
the same but with opposite evaluation (e.g., 
timid) and with two traits of opposite de- 
Scriptive meaning, one positive (e.g., bold) 
and one negative (e.g., rash). Subjects were 
then asked to make inferences from various 
traits to scales consisting of a positive trait 
and its negative descriptive opposite (e.g., 
cautious-rash or timid—bold). Peabody showed 
that the overwhelming percentage of infer- 
ences were in the direction of descriptive 
rather than evaluative similarity. That is, sub- 
jects inferred that a cautious person would 
be more likely to be timid than bold. Further- 
more, factor analyses of correlations among 
the individual trait across scales did not pro- 
duce an evaluative factor although factor 
analysis of the scales (correlated across 
traits) revealed factors in which descriptive 
and evaluative aspects were confounded. Pea- 
body argued that evaluative aspects of mean- 
ing were generally secondary to descriptive 
aspects and that frequently negative traits 
Were more extreme versions of positive ones. 
For example, the negative газй described a 
More extreme version of boldness than bold; 
Similarly, timid was a more extreme version of 
Cautious, age 

Recently, Peabody's research has been criti- 
Cized on two counts. Rosenberg and Olshan 
(1970) argued that the factor analyses based 
On correlations of traits across scales inad- 
Vertently discarded evaluative variance. Suchi 
^ correlation reflected only similarities in 
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shapes of profiles and discarded differences in 
means for traits. These authors found that a 
reanalysis of Peabody's data based on profile 
distances provided dimensions which were in 
part evaluative. In a reanalysis of his earlier 
data Peabody (1970) concurred that the origi- 
nal study underestimated evaluative effects at 
the dimensional level. 

A second set of criticisms was provided by 
Felipe (1970). He argued that Peabody's 
traits were highly related descriptively and 
hence the evaluative dimension may have been 
submerged. He showed that indeed the evalua- 
tive similarities seemed to appear more criti- 
cal and descriptive similarity less important 
than Peabody had thought. Peabody (1970) 
in reply had modified his theory to suggest 
that in relationships between traits evaluative 
effects are likely to be constant while descrip- 
tive effects vary. If they are stronger than 
evaluative effects, they will seem to predomi- 
nate, 

A somewhat different approach to the whole 
question of evaluative bias came from the 
work of Kuusinen (1969a). He used a simple 
but elegant technique to partial out evaluative 
(and also potency and activity) aspects of 
trait ratings. The resultant factor analysis 
produced factors which differed in significant 
ways from the unpartialed structure. The au- 
thor claimed that the unpartialed matrix had 
greater psychological significance. 

Arguments about linguistic determination 
of implicit personality theory could continue 
forever. As argued earlier in this paper, im- 
plicit personality theory related to known 
others and using relevant traits might be 
less influenced by linguistic factors than stud- 
ies using unfamiliar stimulus persons and 
traits, This problem has not been studied ex- 


plicitly. 
INDIVIDUAL DIFFERENCES 


The claim has been made that the search for 
personality correlates of implicit personality 
theory is bound to fail since there are few 
individual differences among perceivers in im- 
plicit personality theory (Kuusinen, 1969b). 
This claim rests on the interpretation of fac- 
tor analysis data, where Kuusinen attempted 
to show that the congruence among factors 
derived from individual subjects was high. 
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This conclusion was disputed by Hamilton 
(1970) as being based on a too lenient cri- 
terion of what constitutes factor congruence. 
Also Hamilton argued that the Kuusinen data 
gave evidence of both individual differences 
and commonality. Using a more stringent 
(and more widely accepted criterion), Hamil- 
ton showed that there are extensive individual 
differences, both in the Kuusinen data and in 
new data collected by Hamilton. 

The research on individual differences can 
be divided into two groups. Some researchers 
have isolated individual differences in im- 
plicit personality theory and then have tried 
to assess how these groups of perceivers dif- 
fer. A second group has begun with a person- 
ality variable and has attempted to see 
whether people who differed along the person- 
ality dimension also varied in their percep- 
tions of others. 

Those studies which have used the first 
general procedure have not been noticeably 
successful. The most common technique has 
been to use an individual differences model of 
multidimensional scaling. This model allows 
one to separate out of the trait implication 
matrix groups of subjects who differ in their 
distances among traits. While theoretically 
powerful, in practice the result is to find a 
major point of view with one or more groups 
of deviant subjects who differ in relatively 
small ways from the first group. Furthermore, 
the points of view do not seem to be strongly 
related to personality variables. 

Walters and Jackson (1966) found two 
points of view with one being generally char- 
acterized as making less extreme judgments 
than the other. There were almost no person- 
ality correlates. Pedersen (1965) located 
three points of view and one point of view 
seemed to be related to authoritarianism. Mes- 
sick and Kogan (1966) found several points 
of view, some of which seemed related to 
broad versus narrow category use, but again, 
the personality data were weak. For the 
moment it appears that classic personality 
variables are not strongly related to the dif- 
ferent points of view isolated in multidimen- 
sional scaling. 

Some studies have relied on correlational 
analyses to isolate individual differences. 
Posavac (1971) presented subjects with pairs 


of traits and had them indicate their prefer- 
ences. From the Subject x Trait-Pair matrix 
correlations among subjects were calculated 
and seven subject factors isolated. Interpreta- 
tions for each subject factor were made in 
terms of personality data, While the correla- 
tions were quite low, interpretations were 
possible for some dimensions. Hamilton and 
Yellig î did factor analyses for separate sub- 
jects and they found not only low congruence 
among subjects but high congruence for each 
subject over time. Thus it would seem that 
the fact that personality variables are not 
related to individual differences is not a func- 
tion of reliable dimension determination. 
There is a second way to approach these 
problems. One might select a priori groups 
and see how their perceptions differ. Wart 
and Knapper (1968, Experiment 1) and Mes- 
sick (1956, 1961) have used this approach 
with mixed succe 
At a more microscopic level there are sev- 
eral reports of individual differences in trait 


inferences. For example, Steiner (1954) ae 
Warr and Sims (1965) found that high 


authoritarians make more extreme inferences 
from traits which are extremely positive or 
negative. Jones (1954) found that high at- 
thoritarians are highly responsive to powe" 
cues in stimulus persons. Warr and Knappe" 
(1968, Experiment 14) and Benedetti and 
Hill (1960) showed that personality differ- 
ences relate to extremeness of inference for 
some cue traits. Shapiro and Tagiuri (1959). 
Warr and Knapper (1968, Experiment 14); 
and Lay (1970) have all found that women 
are more inclined to make extreme inferences 
from certain stimuli than men. Matkom 
(1963) found that maladjusted subjects are 
more certain with positive information than 
adjusted subjects. Finally, Secord and Ber- 
scheid (1963) found minimal differences be- 
tween prejudiced and unprejudiced subjects 
in their attributions from stereotyped traits 
to black and white stimulus persons. 

It is relatively easy to show that individuals 
differ in their implicit personality theory, but 


7 Reported in D. L. Hamilton. Implicit personality 


theories: Dimensions of interpersonal Saga s 
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Paul F. Secord, Recent approaches to were, A 
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there has been limited success relatin: 
differences to traditional personality 
This may, of course, be more damaging to 
traditional personality variables than im- 
plicit personality theory. Future research 
might concentrate more on analyses of single 
perceivers, an approach used ingeniously by 
Rosenberg and Jones (1972) to study the 
novelist Theodore Dreiser. Greater attention 
might also be given to differences among per- 
ceivers in what categories they use or prefer 
to use to judge others, a literature reviewed 
in the next section. 


CONTENT QUESTIONS 
Nontrait Aspects of Implicit Personality 
Theory 

Most of the research reviewed thus far has 
concerned the relationship of traits or, occa- 
sionally, attitudes. There is no reason why 
any attribute of persons can not be used in 
an implicit personality theory paradigm, In- 
deed, as will be argued presently, this would 
More than likely broaden the conception of 
implicit personality theory considerably. 

There have been several lines of research 
which have not generally been thought of in 
implicit personality theory terms but which 
obviously fit the paradigm. 

Facial features. One main tradition concerns 
the inferential relationships between facial 
features and traits and is best represented by 
the work of Secord (1958). There was con- 
siderable agreement among perceivers on 
which traits they were willing to attribute to 
particular photographs, and variables such as 
darkness of skin, the use of cosmetics, and 
curvature of the mouth led directly to person- 
There was also some evidence 


ality inferences. e 1 
ain physical traits 


that subjects inferred cert 
from personality descriptions. 
Emotions. A second line of » 
usually subsumed under the topic of 
Derception accuracy, and it relates to the 
Whole question of inferring momentary states 
Such as emotion from physical, primarily fa- 
Cial cues, Although the emphasis has tended 
to be on the accuracy of inference and cross- 
Cultural constancies in emotional expression 
and perception, there is no reason why such 
àn approach could not be used to study im- 
Dlicit personality theory. Another variety of 


research is 
person 


such research is represented һу work on the 
scaling of facial expressions (Frijda, 1969). 
Since dimensions of such expressions are gen- 
erally interpreted in terms of personality and 
emotional attributes, the work has some rele- 
vance to implicit personality theory. 

Stereotypes. Another kind of research 
which has an obvious relationship to implicit 
personality theory is the work in stereotyping. 
As Hastorf et al. (1970) indicated, stereo- 
types were one kind of implicit personality 
theory and, indeed, race could be viewed as a 
classic central trait for socalled stereotyped 
traits. A recent review of this literature 
(Brigham, 1971) indicated that race is a 
powerful cue in trait attributions. One im- 
portant question is whether the structure of 
trait relationships is different for say black 
and white stimulus persons. Secord and Ber- 
scheid (1963) found no evidence of this. 

Other attributes. It should be indicated 
that there have been relatively few studies on 
the inferences made from socioeconomic 
variables, physical attractiveness, style of 
dress, etc. Gibbins (1969) did find that style 
of dress led to inferences about people. Shy- 
ness, occupations, number of boyfriends, and 
smoking were among the characteristics dif- 
ferentially attributed to wearers of particular 
styles of clothing. Expensive suits, attractive 
figures, beards, and love beads seemed to 
lead to powerful inferences in everyday life. 

Finally, there have been few studies on the 
relationships between behaviors and traits. 
Attribution research (Hastorf et al, 1970) 
has been concerned primarily with conditions 
which allow one to separate situational and 
personal causality, but there may simply be 
behaviors which are thought to be “revealing” 
and of themselves. Nail biting implies 
ousness, for example, at least for many 
perceivers. And, of course, classic psychoana- 
lytic personality theories have suggested 
classes of behavior, which ought to imply 
certain broad personality syndromes. Again, 
there has been no research attention to rela- 
tively stable inferential relationships between 
behaviors and traits. 


in 
nerv 


Units of Person Cognition 


The fact that most studies of implicit per- 


sonality theory have used traits may be 
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largely an historical accident resulting. from 
the importance of traits in personality theory 
and personality assessment research, and from 
the fact that much of the early research in 
the field was stimulated by Asch's impression 
formation research where traits had been used 
merely as a convenience. The fact of the 
matter is, we have little or no hard evidence 
on what the natural units of person cognition 
are. 

A more general point is that we have, at 
present, no basis for determining the circum- 
stances under which perceivers prefer one kind 
of trait to another or for that matter whether 
and when perceivers feel comfortable with 
traits as opposed to more behavioral or typo- 
logical units, Given that a person can be de- 
scribed behaviorally (“Не can't look another 
in the eye"), in trait language (^He's shy"), 
or in a language of typology (“He's intro- 
verted"), when do we choose to use a particu- 
lar level? While attribution research addresses 
this question, it could be argued that so long 
as such research provides perceivers with 
categories of explanation, it can never answer 
the question of which categories are “natu- 
rally" used by perceivers. Attribution research 
as well as implicit personality theory would 
be improved by an analysis of what units 
people use under different circumstances. It is 
worth noting that Bruner and Tagiuri made 
the same plea in 1954. 


Category Relevance 


An issue which seems to pop up in 10-уеаг 
cycles is that of trait relevance. It has been 
argued (Hastorf, Richardson, & Dornbush, 
1958; Kelly, 1955; MacLeod, 1947) that re- 
sults may be biased in unknown ways when 
subjects are forced to use experimenter-pro- 
vided categories. 

One of the major problems faced in such 
research is how to code subjects’ responses. 
Kelly (1955), of course, had subjects make 
responses in terms of traits and their oppo- 
sites, Rosenberg (Rosenberg & Sedlak, 1972) 
also had subjects respond in discrete units, 
and some reduction into trait categories was 
used. Dornbush, Hastorf, Richardson, Muz- 
zey, and Vreeland (1965), Beach and Wert- 
heimer (1961), and Yarrow and Campbell 
(1963) have all developed coding categories. 


There are several questions involved. with 
relevance, The first question is whether the 
relevance of categories or traits is more à 
stable set which the perceiver brings to any 
person description task or whether it varies 
with the stimulus person and the situation. 
Dornbush et al. (1965) reported that a given 
perceiver more likely uses the same categories 
in assessing two stimulus persons than two 
perceivers describing the same stimulus per 
son. Yarrow and Campbell (1963) more 1N- 
formally reported similar results. On the other 
hand, Beach and Wertheimer (1961) reported 
low reliability of category usage in a test- 
retest measure; in the retest, however, sub- 
jects could use different stimulus persons and 
many did. Ў 

A closely related question concerns the reli- 
ability of categories, constructs, and traits. 
Yarrow and Campbell (1963) reported 10% 
reliabilities of categories over time. Fjeld and 
Landfield (1961) have shown high repeat use 
of constructs for the same stimulus persons 
over a 2-week period. Mitsos (1958) found 
that if the same role figures were used, the 
same constructs were used over à 3-mont 
period, but the reliability was lessened if dif- 
ferent stimulus persons were used in the sec- 
ond administration. 

A third question—whether categories vary 
with perceiver and stimulus person character- 
istics, Yarrow and Campbell (1963), Beach 
and Wertheimer (1961), and Richardson, 
Hastorf, and Dornbush (1964) all report dif- 
ferential category usage by various groups 0З 
people for different kinds of stimulus iei 
and in some cases for interactions D" 
these variables, At present, there seems to be 
no general way to summarize these resu ts. 

A fourth question is whether relevant traits 
are used differently than nonrelevant traits. 
Koltuv (1962) showed that subjects assume" 
higher relationships among more relevant Hum 
among less relevant traits. А variety of. rd 
planations is possible for this effect, but 3^ 
the finding of Warr and Knapper (1968, je 
periment 3) that test-retest reliability ple 
scales was positively related to how applet е 
the scales were to a given stimulus person, We 
results of Koltuv may simply reflect t 1 
greater reliability of relevant scales. Gromit i 
and Caldwell (1962), Mitsos (1961), L2" 
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field (1908), and Tajfel and Wilkes (1963) 
have also shown that subjects made more ex- 
treme judgments on constructs they provided 
for themselves. The effect was not replicated 
by Warr and Coffman (1970). If, however, 
the effect was generally true, the Koltuv re- 
sults could also be explained in terms of 
greater variance for the relevant scales. 

The work of Wiggins et al. (1969) and 
Hamilton and Gifford (1970) can also be 
thought of in a relevance framework. These 
researchers presented subjects with profiles 
which differed along several dimensions. Sub- 
Jects rated each profile on some criterion or 
rated the similarity of each pair of profiles 
(Wiggins & Hoffman, 1968), and it was possi- 
ble to determine which dimensions relate most 
closely to the profile ratings. Thus it was pos- 
Sible to determine empirically which of sev- 
eral dimensions subjects or any group of sub- 
jects actually used. It was also possible with 
this technique to isolate groups of perceivers 
who weight cues in the same manner and de- 
termine how these groups of perceivers differ 
(Wiggins et al., 1969). This technique is po- 
tentially a powerful one for isolating cues 
which are relevant not in the sense of being 
used in descriptions but in the sense of being 
informative for different perceivers. 


CONCLUSION 


Several conclusions emerge from this re- 
view, First, researchers have not availed 
themselves of all the possible techniques for 
assessing implicit personality theory. Second, 
there has been an emphasis on the dimen- 
Sional aspects of trait similarities to the rela- 
tive exclusion of questions concerning the 
Content and dynamic qualities of implicit 
personality theory. The sophistication of 
methods is greater than the sophistication of 
the substantive questions. 
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REACTANCE AND THE U 


"WILLINGNESS TO HELP OTHERS ' 


LEONARD BERKOWITZ = 


University of Wisconsin 


A demand, explicit or implicit, to help someone, and even a felt obligation to 
do this, is often resented because the demand or obligation is a bothersome 
threat to the individual's freedom of action. Evidence is cited in accord with 
Brehm's reactance theory indicating that increased pressure to aid a person in 


need at times reduces the individual's willingness to help the person who is 
AC 


dependent upon him. Experiments demonstrate that this nce effect" is 
lessened when the individual is in a good mood, and is increased when he is 
self-concerned and when the help request seems improper or unwarranted. 
People might be induced to act helpfullv in spite of the threat to their in- 
dependence by procedures that clearly establish helpfulness as socially desirable 


conduct and provide practice in behaving helpfully. 


In March, 1964, Catherine Genovese was 
stabbed to death in three separate attacks 
lasting more than half an hour even though 
thirty-eight respectable, law-abiding citizens 
knew of her plight. Not one person tele- 
phoned the police during the assault; one wit- 
ness called after the woman was dead. Com- 
menting on the incident later, Rosenthal 
(1964) attributed the failure to help Miss 
Genovese to “а widespread indifference." Only 
under certain situations, he believed, would “а 
man step out of his shell toward his brother 
[p. 81]." 

We can offer a very general and quite ob- 
vious reason why man doesn't readily help 
his brother: The effort entails some cost, real 
or anticipated, and he is often unwilling to in- 
cur this cost. But to say this much is clearly 
not enough. We must determine what kinds of 
costs are involved in helping other people, 
and also explain why help is at times given 
in spite of these costs. 

The well-known experiment by Darley and 
Latané (1968) illustrates the reluctance to 
help someone in need even when the effort re- 
quired is not too great. The undergraduates in 
this study, supposedly assembled in groups of 
two, three, or six members were slower to 
call for help when one of their discussion 
partners had a (staged) epileptic seizure, the 


1 The author's research and the preparation of this 
paper were aided by Grant GS-1890X from the 
National Science Foundation. 

? Requests for reprints should be sent to Leonard 
Berkowitz, Department of Psychology, University of 
Wisconsin, Madison, Wisconsin. 


larger the number of other bystanders in the 


group. In other words, there evidently was 4 
stronger tendency to “let George do it” the 
more Georges there were in the situation who 
might also help the victim. The investigators 
speculated that the subjects felt a greater dif- 
fusion of responsibility with the increasec 


We d 
number of supposed bystanders who coul 
also do something.” i 
7 Alli 0 
Why were the undergraduates unwilling 


aid the other person? They apparently recos- 
nized an obligation to help him, but even 50, 
many of the subjects did not want to соте to 
his assistance, and rationalized away their ob- 
ligation if other people were present. The 
other bystanders provided a welcome alibi. ў 
The subjects could have been afraid dte 
number of things. As Latané and Dari 
(1970) noted, the students might have ү 
reluctant to intrude into someone else' R is 
sonal affairs. The stranger's epilepsy WAS ie 
private problem and they did not want (0 ben 
barrass him. They probably also did not be 
to act inappropriately in this strange E 
tion. Not knowing what to do, they woul б 
nothing if other people were around. Suppor: 
ing this possibility, subjects in a later bo 
ment by Schwartz and Clausen (1970) 


sartz 
" . i hwar 
? [n a later replication of this experiment pr 
and Clausen (1970) obtained a significant s 


y- 
E s = А i er of РУ 
in speed of helping with an increased numb than 


е М á er 
standers only in the case of females тае айй 
males. The findings in this later study also re- 

Я E nly the 


that the addition of bystanders affected o 
porting of the emergency to others and no 
ject's likelihood of taking direct action. 


t the sub- 
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ported the emergency much more quickly ií 
they knew exactly how to help the victim than 
if they did not have this information. Further- 
more, as Latané and Darley (1970) also sug- 
gested, the subjects could also have been wary 
of. the legal entanglements that might have 
arisen if they took action. All of these fears 
undoubtedly existed. Latané and Darley also 
found that subjects were much quicker to aid 
the epilepsy victim if the other bystander was 
a friend rather than a stranger, perhaps be- 
Cause the friend’s presence helped alleviate 
Some of these anvieties. 

This situation may also arouse another type 
Of motive: a desire not to get involved with 
Someone else because the involvement lessens 
one's freedom of action. Going to the victim's 
aid might mean having to stay with him for 
Some time, Perhaps it would require a con- 
tinued commitment to help him. More gen- 
erally, the victim has some control over his 
Potential helper to the extent that this latter 
регзоп feels pressure to aid the individual in 
need," and the subjects might even have re- 
Sented the obligation they felt to help the 
Victim, Responsibility for another person's 
Welfare is also a pressure threatening one's 
freedom, 

This resentment can be important when 
Someone needs help. Whatever other costs 
might be anticipated in such situations—the 
risk of bodily harm, unwanted muscular ef- 
Ort, the fear of intruding, and so on—many 
People are often unwilling to assist someone 

€cause his need for assistance also implies а 
Othersome restriction of their independence. 

is reasoning closely parallels Brehm's 
(1966) theory of psychological reactance. 

rehm maintains that “reactance” is gen- 
"rated by threats to the individual's behav- 
‘oral freedom. The result, Brehm says, is hos- 
“ity, as well as an increased desire to do 
Whatever it is that the individual believes he 
May not be able to do. In general, then, re- 
ACtance jg aroused when a person learns 
у 50теопе is dependent upon him for help; 

d this reactance could lessen the potential 

ыз willingness to aid the dependent in- 

ual. 


ople 1 NETS 

whe Dopler and Bateson (1965) phrased this qe 

depe; they entitled a study of the tendency to help 
dent others “the power of dependence. 
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An experiment by Latané and Darley 
(1970, pp. 81-85) can be interpreted as pro- 
viding evidence for reactance. Male college 
students who were waiting to be interviewed 
for an opinion survey individually overheard 
a fight between two children in the adjoining 
room. A bully seemed to be picking on a much 
younger child. In one condition the subjects 
were led to think the children were alone, 
while in another group they had been in- 
formed that another adult was in the room 
with the youngsters. The researchers assumed 
that the subject would feel some responsibility 
to stop the fight if he were the only adult 
around, and would think himself less obliged 
to intervene if the other adult in the room 
with the children could stop the fight. The 
felt pressure to intervene in the former con- 
dition apparently was unpleasant and mo- 
tivated the men in this group to deny their 
responsibility. When they were later asked to 
say what had been going on in the next room, 
fully 88% of the men in the condition having 
another adult in the room with the children 
said the youngsters had been having a real 
fight. On the other hand, if they had been 
the only adult present, so their responsibility 
was great, only 25% of them said the fight 
was real. The felt responsibility was an un- 
welcome burden, leading most of the subjects 
to deny it. 

Reactance can also be seen in an experi- 
ment by Berkowitz, Klanderman, and Harris 
(1964). The subjects were to construct paper 
boxes in behalf of a peer so that the greater 
the number of boxes they produced, the more 
help they gave him, although they did not 
stand to gain any explicit benefits for them- 
selves by working hard. In half of the cases, 
the experimenter would see their output right 
away, while the remaining subjects were told 
that the experimenter would not learn how 
much help they had given their peer (because 
their work was to be placed inside a sealed 
carton) until the end of the semester weeks 
away. The subjects were either men or women 
and the experimenter was also either male or 


female. B 
Analvsis of the increase in productivity 


irom a prior practice period revealed a sig- 
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nificant interaction among the three variables, 
sex of subject, sex of experimenter, and how 
soon the experimenter would learn of their еї- 
fort. When coníronted by an opposite-sex ex- 
perimenter, the subjects tended to work harder 
if the experimenter was immediately aware 
of their effort than if he was not. They wanted 
to gain the opposite-sex experimenters ap- 
proval if it could be gotten quickly. However, 
if the experimenter was of the same sex, they 
actually worked less hard when the experi- 
menter could see their effort at that time than 
when their work would not become apparent 
for several weeeks This same-sex experi- 
menter’s immediate awareness of their work 
was an additional pressure which evidently 
lessened their willingness to aid their partner. 
Instead of complying to the watching experi- 
menter’s “demand,” they resisted it, pre- 
sumably because of the reactance they felt. 

In this experiment the experimenter's de- 
mand was only implied. Jackson has shown 
that the experimenter's explicit request can 
also generate reactance which might lessen 
the subjects’ motivation to help him out. In 
Jackson's" study the experimenter first did 
half of the subjects а favor following a pro- 
cedure to be described later in this paper—he 
voluntarily provided each person in this con- 
dition with a soft drink. Then, some subjects 
were told that the experimenter was highly de- 
pendent upon them because he greatly needed 
"good" results from them, while other sub- 
jects were led to think he was much less de- 
pendent upon them. The performance was a 
verbal conditioning task. Jackson found that 
subjects who were aware of the reinforcement 
contingency displayed more verbal condition- 
ing the greater the experimenter’s dependency 
upon them—but only if the experimenter had 
not done them the favor earlier. Presumably 
because the prior favor had increased the 
pressure they felt, the subjects given the soft 
drink earlier actually showed /ess verbal con- 


5 Moscovici and Neve (1971) found, in accord 
with these results, the subjects were more likely to 
conform to another's expressed judgments when that 
person was out of the room as compared to when she 
Was present. 

SJ. б. Jackson: 


The impact of a favor and de- 


pendency of the reinforcing agent on social rein- 
forcer effectiveness. Unpublished doctoral d sertation, 


Wayne State Uni rsity, 1972. 


LEONARD BERKOWITZ 


ditioning under high than low experimenter 
dependence. 

As was suggested earlier, reactance might 
also be generated by the possibility of a long- 
standing commitment to help someone. The 
threat of someone's persistent control over 
the individual could be particularly bother- 
some. Jones (1970) found that college men 
were least willing to aid a dependent peer 
when his request implied that he would need 
their services for some time in the future. 

The reactance conception implies, of course, 
that resentment should increase as greater 
pressure is placed on the individual. More 
specifically in regard to helpfulne: the 
stronger the demand, explicit or implicit, that 
he aid someone, the more intense should be 
the reactance generated, But greater pressure 
should also heighten the chances of com- 
pliance (the individual will feel more ob- 
ligated to do the “right” thing) up to some 
maximum. Beyond this point the individual 
might well rebel so that extremely strong 
pressure actually produces decreased help- 
fulness, especially if he can find some justifi- 
cation for not giving assistance.* 

Schwartz (1970) asked people who had 
just given blood at a local Red Cross center 
to join a pool of possible bone marrow donors 
waiting to aid a young woman. The implicit 
pressure was manipulated in two ways: One, 
by mentioning the possible consequences. for 
this woman—from a statement that she might 
be helped (low pressure), through a middle 
level, to a third, most intense level created by 
the statement that she was unlikely to sur- 
vive without a matching bone marrow dona- 
tion and that her death would be a tragedy 
for her children. Second, the demand was also 
varied by saying the odds were either 1 in 
1,000 or I in 25 that the subject’s own bone 
marrow would match the woman’s and that 
he would actually be asked to be a donor. 
When the odds were quite small that the sub- 
ject would be a suitable donor, the greater the 
pressure produced by emphasizing the con- 
5 the 


* The consultant who had read this paper е 
journal advanced the interesting possibility that ; 35 
positive function relating pressure to matus e 
help is negatively accelerating" while "the xc 
function relating pressure to reactance" may be "pos 
tively accelerating." 
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sequences for the young woman, the more 
willingly the subjects agreed to join the pool 
of possible donors. However, if there was a 
fairly good chance that the person would 
actually be called upon, there was a curvi- 
linear relationship between the consequences 
Manipulation and the subject’s willingness to 
volunteer; the subjects were actually most 
reluctant to be a possible donor when they had 
been exposed to the greatest pressure—telling 
them that the woman probably would die 
Without a successful transplant. 


CONDITIONS AFFECTING TOLERANCE OF 
REACTANCE 


The following discussion assumes that re- 
actance is typically aroused when people learn 
that others need their aid, but that various 
situational conditions affect their tolerance of 
this reactance, Although there are some data 
in accord with this view (Berkowitz, 1969), 
u is also quite possible that these conditions 
Influence the experienced magnitude of the re- 
actance rather than the person’s capacity to 
withstand it. 

Whichever alternative is correct, people are 
Usually much more willing to aid the indi- 
Vidual requiring assistance because of factors 
beyond his control than the person whose de- 
Dendency is his own fault (Schopler & Mat- 
thews, 1965). They evidently are often un- 
Nappy about the obligation imposed on them 
>or the obligation they feel—to help the in- 
dividual in need, but may not actively resist 
the felt demand unless they regard the de- 
Mand as unwarranted or improper. 

The seemingly improper demand might even 
lead to a boomerang effect. Berkowitz (1969) 
demonstrated that there were different rela- 
tionships between the degree of a partner’s 
dependency on the subject and how much 
Work the subject did for him under externally 
2 internally caused dependency. When the 
Partner required help because of circumstances 
m ae his control, the greater his е 
Mined subject, the more work the 
Ой пыр 90 his behalf. When the Deep 

i Ceficiency had made him depende T 

Subject, however, increased levels of de 
©су were associated with a somewhat 
* level of work in his behalf. Again, it !5 


Owe 
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as if the subjects could succumb to the re- 
sentment created by the relatively great de- 
mand upon them when this demand was not 
entirely proper. 

Another experiment (Horowitz, 1968) re- 
ports direct evidence of the importance of the 
feeling of behavioral freedom in reactions to 
“improper” demands. College students were 
more willing to help a person requiring as- 
sistance because of his own inadequacies if 
they thought they had a freedom of choice 
than if they believed they were required to 
help their dependent peer. 

The freedom of choice given the subjects in 
the Horowitz study evidently had lowered 
their reactance somewhat. In other situations, 
explicitly telling subjects that they are free 
either to help someone or not to do so could 
give them permission to succumb to the re- 
actance they feel and not aid the person in 
need. According to data reported by Jones 
(1970), male undergraduates provided with 
a clear statement of their freedom of choice 
were most apt to refuse to help a peer who 
was highly dependent on them, especially if 
the request implied that the dependent per- 
son would have further need of them in the 
future. 

If the pressure to aid another were gen- 
erally bothersome, we certainly would expect 
this pressure to be even less tolerable if the 
individual were in a bad mood. Already feel- 
ing bad, demands to help someone would be 
another source of annoyance, and the pres- 
sure might be resisted whether the demands 
were explicit or only implicit. 

Evidence to this effect has been obtained in 
an experiment by Berkowitz and Connor 
(1966). College students given a success ex- 
perience later worked harder for a peer who 
was dependent on them than did other sub- 
jects who had been made to fail on this pre- 
liminary task. The “glow of success" evi- 
dently 'enhanced the subjects! motivation to 
help ‘the individual needing their assistance 
even though this latter person was not di- 
rectly connected with the success. Other find- 
ings in this study also point to increased re- 
sentment as more pressure is felt by the stu- 
dents. Thus, the subjects who had failed 
earlier and were presumably in a bad mood 
expressed more dislike for the experiment the 
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greater their partner's dependency on them. 
The high level of their partner's dependency 
on them (and the resulting greater felt de- 
mand) did not lead to the great expressed 
hostility in the case of the subjects having 
the success experience. The earlier failure ap- 
parently had lowered the subjects' ability to 
tolerate the dependent person's implied con- 
trol over them.“ 

A field experiment by Berkowitz and Ma- 
caulay also shows how situationally induced 
moods can influence a person's willingness to 
help others (Berkowitz, 1972). Individually 
interviewed women shoppers were either given 
praise and approval or only noncommital neu- 
tral responses during the course of the ques- 
tioning. Soon after the completion of the in- 
terview a male college student approached 
each woman and asked her for 40 cents for 
bus fare, explaining that he had lost his 
wallet. Other women in the control condition 
were similarly approached without the preced- 
ing interview. The interviewed-but-not-praised 
shoppers were much much less likely than 
the control women to give money to the stu- 
dent requesting aid, probably because the 
time-consuming interview had lowered their 
willingness to provide this form of help. How- 
ever, if the women had been praised during 
the questioning, their motivation to help the 
student increased; a reliably higher propor- 
tion of interviewed-and-praised than inter- 
viewed-but-not-praised women gave the money 
to the boy in need. The interviewer’s approval 
had put the shoppers in a good mood and 
had enhanced their willingness to be helpful. 

More recently, Isen and Levin (1972) dem- 
onstrated that this increased helpfulness by 
people in a good mood is not simply a matter 
of their being generally compliant. The col- 
lege men in this study who had a pleasant 
experience with one person were later more 
willing than others not having this experience 
to aid another individual soliciting their help 
(by agreeing to serve in his experiment) only 
when their behavior would supposedly benefit 
rather than distract the other people also par- 
ticipating in the ostensible investigation. Their 
presumed good feeling only made them more 

8 More recently, Isen (1970) and Isen and Levin 
(1972) have replicated and extended this finding. 
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inclined to carry out the helpful activities 
that were requested of them. 

Self-concern also lessens the chances that 
one person will help another. Several reasons 
can be advanced for this. For example, as 
Aronfreed (1970) suggested, the potential 
help-giver's empathic reactions to the person 
in need might lead him to give the required 
assistance. Self-concern conceivably could in- 
terfere with this empathy. This individual who 
thinks only of himself is not likely to experi- 
ence the anxieties felt by someone in need of 
help or to anticipate this person's joy at 
reaching his goal. In addition, of course, a 
self-concerned person is probably in a fairly 
bad mood. Whatever the exact cause, for the 
person who is preoccupied with questions of 
his self-worth, someone else's call for help, or 
the obligation to assist someone, is another 
unwelcome pressure. 

This self-concern might arise in a number 
of ways. Failure and frustration could have 
this effect, and the Berkowitz-Connor study 
might perhaps be understood in these terms. 
Berkowitz has also hypothesized (Berkowitz, 
1970) that people might react with this al- 
truism-lessening self-concern when they be- 
lieve their self-worth is being assessed. An 
experiment by Brehm and Cole (1966) illus- 
trates this. Male undergraduates were first 
given either an innocuous introduction to the 
study that played down its importance or were 
told that their chances of being a success in 
life would soon be assessed. This latter ped 
ment presumably increased the men's se - 
concern. Following this introduction, and w 
tensibly before the experiment proper A 
under way, in half of the cases another p? 
son (the experimenter's confederate) did. k 
subject a favor by giving him a soft au 
When the experiment then supposedly ers 
the confederate was assigned an arduous p. 
and the subject was told he could Help 
if he desired. If the confederate had not 
the subject a favor before, about half 0! hs 
men came to his aid when they had finis e 
their own assignment. On the other bm 
93% of the subjects helped the confede v 
who had done them a favor earlier m = d 
when the experimental situation was S8! 
relatively unimportant. 
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The previous favor had evidently aroused 
an obligation to reciprocate, and the men 
generally complied if they were not selí-con- 
cerned, When the subjects’ chances of success 
in life had supposedly been measured, how- 
ever, only about 13% of the men helped the 
person who had done them a favor earlier. All 
in all, the men who were induced to be self- 
concerned evidently resented the pressure they 
felt to return the confederate’s favor, and 
thus were least likely to help the confederate. 

Several experiments by Berkowitz and his 
students (Berkowitz, 1972) have also shown 
that heightened self-concern leads to lowered 
helpfulness. One of these studies also demon- 
strated that male and female college students 
typically become concerned about different 
things. While each subject waited for his or 
her supposed partner, the experimenter (who 
Was of the same sex as the subject) asked the 
subject for a favor, clearly indicating this was 
unrelated to the study: Would he (or she) 
score some data from another experiment? 
The subject’s output at this time provided a 
base-line measure of his (or her) typical work 
tate. After 5 minutes the experimenter said 
the other person undoubtedly was not going to 
show up, but added that they would proceed 
with part of the experiment. 

In creating the self-concern variations, the 
Subject was shown pictures of groups, and in 
two of the three conditions was asked to 
identify the leader of each group and the 
Broup attitude toward this person. The pic- 
tures were described as a test of “social sensi- 
livity" in one of these conditions, and as à 
Measure of the individual's "supervisory abil- 
Цу” in the other treatment. It was expected 
that the women would become especially con- 
cerned about their performance on the sup- 
Posed test of social sensitivity and that the 
Men would be most concerned about doing 
Well on the supervisory ability test. Finally, 
In à third, control condition, the subjects were 
Only to indicate their liking for the pictures. 

fter the subjects had written their judg- 
ae the experimenter collected them 2 
S he would go off to score these 7 ws 
th У. Each subject was asked to wee n 

* experimenter returned with the scores. 

ut while waiting, he was asked to help the 
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experimenter again by scoring the other data 
he had collected from the other study. 

Analysis of the change in work rate during 
this period from the earlier, base-line rate 
yielded the expected outcome. The men in- 
creased their work rate after taking either the 
“social sensitivity" or *preference" tests, but 
actually slowed down when they thought the 
test had measured their “supervisory ability.” 
In the case of the women, on the other hand, 
the lowest increase in work arose after they 
had taken a test of their “social sensitivity." 
АП in all, the men and women seemed to have 
somewhat different concerns, but whatever 
made them selí-preoccupied, the arousal of 
self-concern evidently lowered their willing- 
ness to aid the person requesting help. 


CONCLUSION 


Assuming that people do resent the obliga- 
tion they feel to aid those who are dependent 
upon them, then, the next question becomes, 
how can they be induced to live up to this 
obligation and be more helpful? One answer, 
of course, is to minimize those situational 
conditions that appear to lessen their tolerance 
for reactance. But it may be that more posi- 
tive steps can be taken as well. These pro- 
cedures would (a) clearly established help- 
iulness as socially desirable conduct and (5) 
provide practice in behaving helpfully. 

Although conclusive evidence of the part 
played by social ideals is still lacking, there is 
some reason to think that normative con- 
siderations affect the individual's readiness to 
come to the aid of persons in need (cf. Ber- 
kowitz & Daniels, 1964; Rawlings, 1970; 
Schwartz, 1970b). Thus, according to London 
(1970), a number of Germans who had risked 
their lives to assist Jews seeking to hide from 
the Nazis had a "strong identification with a 
moralizing parent" and firm moral convictions 
themselves. 

There is somewhat more direct evidence in 
a recent replication of the Darley and Latané 
diffusion-of-responsibility experiment carried 
out by Horowitz (1971). Using members of 
either socially oriented. groups (fraternities) 
or campus community service groups as sub- 
jects, each person was told either that he and 


one other subject were alone in the communi- 
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cation network or that there were three other 
bystanders as well. When the “other subject” 
had the supposed seizure, the service-group 
members were more likely to intervene than 
were the social-group members, especially if 
the experimental instructions had deliberately 
reminded them of their group membership. 
Furthermore, although the social-group sub- 
jects displayed the diffusion of responsibility 
effect noted by Darley and Latané, the pres- 
ence of the additional bystanders actually 
served to increase the service-group members’ 
helpfulness. These latter persons had pre- 
sumably become more aware of their group- 
shared ideals prescribing helpfulness when 
they were in the presence of others, particu- 
larly when they had been reminded of their 
group membership, and the more salient ideal 
might have induced the greater helpfulness. 

However, normative considerations alone 
are often insufficient; the pressure to do the 
“right” thing may be resented and then 
avoided, Staub (1971) suggests that children 
in our society are more apt to be taught pro- 
hibitions than rules positively calling for pro- 
social behavior, so the latter ideals may be 
relatively weak in most people. Still, if the 
individual practices adherence to the moral 
standard, he is all the more likely to act in 
conformity to it at other times (Rosenhan & 
White, 1967). In keeping with this possi- 
bilitv Whiting and Whiting (1970) reported 
that those cultures in which children tend to 
be most helpful have typically assigned im- 
portant familial tasks to these youngsters in 
the course of their socialization. Repetition of 
socially desirable behavior might well lessen 
the person’s resistance to act in this manner, 
especially when the conduct is rewarded. All 
in all, encouraging people to “do their own 
thing” and to become preoccupied with their 
own concerns is all too apt to promote selfish- 
ness rather than helpfulness ( Berkowitz, 
1970). If we want greater helpfulness, we 
probably have to teach people that this is 
desirable behavior, have them rehearse this 
action, and reward them when they do assist 
others. 
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^IMPLOSIVE THERAPY A? 


D FLOODING PROCEDURES: 


A CRITICAL REVIEW * 


KENNETH P. MORGANSTERN ? 


Pennsylvania State University 


After a brief explanation of implosive theory and technique, 


the case reports 


and experimental investigations of implosive therapy and flooding procedures 
with human subjects are critically reviewed. Much oí the research has produced 


confusing and contradictory results and these. findings 
of the type of response rehearsed in therapy, exposure 
a majority of the experimental reports are 


factors. However, since 


confounded, interpretation of the literatur 
cluded that there is, at present, no convincing evidence o 
implosion or flooding with human subjects nor is there a 
techniques are superior to systematic desensitization. In 
appears to be unsupported. Finally, 
ised in regard to the desirability oi the 


- cal basis of implosive therapy 
realistic and ethical questions are га 
clinical use of implosive procedures. 


The last decade has witnessed an explosion 
in the behavior modification literature. Of par- 
ticular importance has been the development 
of two principle techniques aimed at the re- 
duction or elimination of conditioned fear and 
avoidance behavior. One of these procedures, 
systematic desensitization, has seen the pub- 
lication of hundreds of reports which have 
proceeded from initial demonstrations of ef- 
fectiveness (e.g., Wolpe, 1958), through fac- 
torial designs (e.g., Paul, 1966), to analyses 
of treatment components (e.g, Davison, 
1968). Finally, several comprehensive and 
critical reviews (e.g., Bandura, 1969; Lang, 
1969; Paul, 1969b, 1969c; Rachman, 1967; 
Wilson & Davison, 1971) have helped in the 
integration and evaluation of the empirical 
data and in the validation of the theory under- 
lying desensitization. 

In contrast, experimental investigations of 
implosive therapy (the other major technique 
aimed at the reduction of phobic and avoid- 
ance behavior) have been noticeably lacking. 
Slightly more than 25 reports appear to date 


l'This article was written while the author was 
supported in part by a United States Public Health 
Service Fellowship. Acknowledgment is extended to 
W. Edward Craighead for his helpful suggestions and 
comments on an earlier draít of this manuscript. 

2 Requests for reprints should be sent to Kenneth 
P. Morganstern, Department of Psychology, Pennsyl- 
vania State University, 417 Psychology Building. 
University Park, Pennsylvania 16802. 


are discussed in terms 
time, and cognitive 
so badly 
e is exceedingly difficult, It is con- 
{ the effectiveness of 
ny evidence that the 
addition, the theoreti- 
serious 


dt 


with the majority of this research conducted 
by the innovators and proponents of implo- 
sion or by doctoral candidates. The studies 
have shown little evidence of the systematic 
progress in methodological sophistication that 
has characterized the development of desensi- 
tization research, nor has there been any 
integrated review published. 

This article, then, examines the case reports 
and experimental investigations of implosive 
therapy with human subjects. Animal studies 
have been excluded, except in a few cases 
where subhuman analogues have had impor- 
tant implications for theoretical validation or 
clinical practice. Finally, research in flooding, 
a similar but distinct. technique, is also re- 
viewed and evaluated. 


IuPLOSIVE THEORY AND ' TECHNIQUE 


Although slow in coming, the theory and 
the technique of implosive therapy has now 
been amply outlined in several reports 
(Hogan, 1968, 1969: Levis, 1967; Stampfl; 
1967: Stampfl & Levis, 1966, 1967a, 1967, 
1968, 1969). During the late 1950s, Thomas 
Stampfl, with the assistance of George Golias, 
Robert Hogan, and, later, Donald Levis, de- 
veloped a learning-theory-based approach ee 
treatment incorporating many of the formula 
tions consistent. with psychoanalytic iy 
Based upon an extension of Mowrer $ Б" 
factor model of learning (e.g. Mowrer, 1997; 
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1947), phobic or "symptomatic" behaviors 
are seen, according to Stampll, as end pro- 
ducts in a long chain of higher-order condi- 
tioning. These behaviors are viewed as at- 
tempts to reduce the anxiety associated with 
the more threatening thoughts and imagery 
that are supposed to have accompanied the 
original aversive conditioning. Defensive ma- 
neuvers of the patient are seen, then, as а 
means of avoiding the dangerous associations 
which elicit anxiety, even if the patient is 
unaware of them. 

The most basic associations are related to 
sex, rejection, oral and anal functions, aggres- 
sion, bodily injury, guilt, punishment, and loss 
of impulse control. Since the latter are more 
closely associated with the original aversive 
conditioning events, they are considered to be 
More important and are more likely to elicit 
the anxiety response (even though they may 
be "repressed"), than are the observable 
"symptoms." 

Important in the development of the thera- 
peutic technique was the experimental evi- 
dence of Miller (1951), Solomon, Kamin, and 
Wynne (1953), and Levis (1967) who dem- 
onstrated that extinction occurred more rap- 
idly and more fully the closer the extinction 
Situation approximated the original learning 
conditions. In addition, it was suggested that 
the more intense the fear evoked, the greater 
the anxiety decrement. Although Ѕіатрӣ and 
Levis (1967a) cite additional evidence sup- 
Porting this hypothesis, they go well beyond 
the experimental data (entirely subhuman) in 
extrapolating the principles to therapy. For 
example, Stampil (1967) hypothesizes that 
Psychodynamic themes and conflicts must be 
Present in the original conditioning situations 
and that these “stimuli” have to be included 
In therapy, The goal, then, of the implosive 
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a reproduce in the absence of physical pain, as good 
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tively high levels of anxiety. Their repetition in the 
absence of a primary reinforcer, however, should 
lead to a decrement in their av eness through 
the principle of generalization of extinction and 
thereby to a reduction in the avoidance reactions 
which constitute symptomatic behavior . . . [Stampfl 
& Levis, 1968, p. 33]. 


The procedure of implosive therapy begins 
with an initial assessment of the crucial stim- 
uli that are associated with the patient's anx- 
iety. From this assessment an Avoidance 
Serial Cue Hierarchy is constructed, with con- 
ditioned stimuli ordered according to the ex- 
tent to which they are associated with the 
original primary reinforcement, Symptom-con- 
tingent cues, involving situational and en- 
vironmental stimuli that are associated with 
the actual avoidance behavior, are low on the 
Avoidance Serial Cue Hierarchy and are the 
ones presented and extinguished first. Hy- 
pothesized sequential cues (assumed to be 
repressed) are higher on the Avoidance Serial 
Cue Hierarchy and involve dynamic themes 
such as aggression, punishment, and sexual 
material. The latter cues are also seen to be 
related to the patient's symptomatic behav- 
ior, but because they are more threatening 
and more highly avoided, they are extin- 
guished last. According to Stampfl and Levis 
(1967a), it is not important that the patient 
accept the dynamic interpretation, the em- 
phasis being simply on the extinction of all 
the anxiety-evoking cues which provide both 
motivational and reinforcing properties for 
perpetuating the patient's avoidance responses. 

During implosive therapy, the patient is in- 
structed to “lose himself" in the part that 
he is playing and “Jive” the scenes that the 
ist describes with genuine emotion and 


therap 
s an attempt 


affect. At each stage of the proces 
is made by the therapist to attain a maximum 
level of anxiety evocation from the patient by 
embellishing the scenes with the most horrify- 
ing and unrealistic images. When a high level 
of anxiety is achieved, the patient is held at 
this level until he implodes—a spontaneous 
reduction in the anxiety-inducing value of the 


cues (Stampfl & Levis, 1967a). This pro- 


cedure is continued until a significant diminu- 
lted, the therapist 


tion in anxiety has resu i 
scending the Avoidance Serial Cue 


gradually a 


320 


Hierarchy until the anxiety elicited by all 
stimuli has been extinguished. 


Literature REVIEW 


Before consideration of the appropriate re- 
search, it is important to make a distinction 
between flooding procedures and the tech- 
niques usually employed by implosive thera- 
pists. Many reports have confused the two, 
often using the terms interchangeably. There 
seems to be quite a difference, however, be- 
tween flooding in which intense stimuli are 
simply presented for an extended period of 
time, and implosive procedures that expose 
patients to unrealistic, but horrifying and 
vivid scenes related to the phobic stimuli. As 
Bandura (1969) has pointed out, "there is 
considerable difference between exposing peo- 
ple repeatedly to a fearsome collection of 
rodents without any adverse effects and de- 
picting them eating human flesh [p. 404]." 
Although research in both areas is presented 
in this review, it is separately evaluated 
(whenever possible). 


Case Histories 


A few researchers (e.g, Hogan |1963]|, 
1966; Levis, 1967; Stampfl & Levis, 1967a, 
1968) have reported high degrees of success 
using implosive therapy over a wide variety 
of psychopathological problems. These dis- 
orders have included “both psychoneurotic 
disorders such as anxiety, phobic, obsessive, 
compulsive, and depressive reactions, and p 
chotic disorders such as affective, 
phrenic, and paranoid reactions [Stampi & 
Levis, 1967a, p. 502|.” 

Hogan (1968) reported an exceptional case 
in which a severely disturbed woman received 
eight implosive therapy treatment sessions. 
The scenes that the woman was asked to 
visualize during therapy included extreme ag- 
gression, punishment, destruction, and rejec- 
tion, as well as many phobic stimuli which in- 
cluded rats, bugs, snakes, and mice. The treat- 
ment, according to Hogan, produced a re- 
markable cure although no data is presented. 


schizo- 


ЗВ. A, Hogan. The implosive technique: A pro- 
cess of reeducation through the application of prin- 
ciples of learning for emotionally disturbed indi- 
viduals, Unpublished doctoral dissertation, Western 
Reserve University, 1963. 
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However, the author reports that, in addi- 
tion to shifts on several Minnesota Multi- 
phasic Personality Inventory (MMPI) scales 
in the direction away from pathology, the pa- 
tient lost weight, was no longer depressed, 
showed better contact with reality, and was 
no longer plagued by guilt, paranoid thoughts, 
or hypochondriacal symptoms. Finally, fears 
of death, people, sex, assault, and insects were 
completely eliminated after treatment. 

Baum and Poser (1971) have recently re- 
ported two successful cases of flooding pro- 
cedures in vivo. In the first instance, a woman, 
described as a severe obsessive-compulsive, 
was afraid of contracting “cancer germs” from 
a wide variety of objects, particularly raw 
meat. One session, in which the patient ap- 
proached, picked up, and played with raw 
pork, successfully extinguished her fears. In 
the second case, a 19-year-old schizophrenic 
bov, whose main fears were being with girls 
his own age and public speaking (because of 
an obsession about “bad breath"), was also 
successfully treated in one session. The treat- 
ment consisted of playing a game with several 
young women in which he was forced to be 
very close to them for an extended period of 
time. 

Even Wolpe (1969) reported two cases in 
which he used flooding procedures. One of the 
patients was markedly improved after a single 
session, but in the second case where flooding 
procedures were employed in vivo, anxiety 
was noticeably intensified, resulting in hos- 
pitalization of the patient. Wolpe concluded 
that individual differences determine whether 
a person will be amenable to flooding meth- 
ods, or will be harmed by them, and empha- 
sized that the risks involved were not justfie Д 
except as a final resort. 

Although it is unreasonable to expect many 
therapists to report failures, Hogan (1966); 
in fact, reported an unsuccessful attempt by 
Golias and Hogan in 1961. In this instance: 
over 50 sessions of implosive therapy with 2 
29-year-old chronic schizophrenic served only 
to produce “sociopathic behavior" in the pa 
tient, from which “he never recovered. 1 

While several investigators have reportec 
overwhelming success with patients using a 
plosive or flooding techniques, some have ге- 
ported failures, making Stampll's early claim 
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of 100% success unjustified. None of these 
reports have included data, although а few 
point to “improved MMPI profiles." While 
interesting, perhaps, case histories have little, 
if any, formal scientific value, and at best 
provide very crude hypotheses about cause- 
effect relationships. All of the reports contain, 
what Paul (1969a) and Underwood (1957) 
have termed, “lethal errors” (i.e., a complete 
confounding between classes of variables) 
and, therefore, do not allow for any conclu- 
sions about the effectiveness of implosion or 
flooding. 


Experimental Research: Implosion 
A series of research investigations by 

Hogan and Kirchner offered the first and most 
Optimistic reports on the effectiveness of im- 
plosive therapy. Hogan (1966) conducted the 
initial study of implosion in the early 1960s at 
the Hawthornden State Hospital in Mace- 
donia, Ohio. Fifty psychotic patients were as- 
signed to either implosive or nonimplosive 
therapists by a ward physician who was re- 
portedly unaware that a study was being con- 
ducted, The groups were equated for sex, age. 
education, IQ, prior hospitalization, and 
length of institutionalization. All subjects par- 
ticipated, in addition to the treatments under 
Study, in group therapy. recreation, work, 
educational programs, and occupational ther- 
apy. Results indicated no significant MMPI 
Changes (the only dependent measure used) 
for subjects who were assigned to nonimplo- 
Sive therapists, whereas subjects receiving im- 
Dlosive therapy showed significant shifts away 
from pathology on five MMPI scales. As a 
further index of success, 18 of the 26 im- 
ploded subjects had been discharged from the 

Ospital at a 1-year follow-up, while only 
eight of 24 control subjects had left. 

, Kirchner and Hogan (1966) randomly as- 
Signed 40 rat-phobic college females to either 
an implosion or control condition after they 
Vete unable to pick up a rat on à pretest. АП 
"bjects received treatment together by list 
€ through individual earphones to а tape 
e E in a language ыйа, ae 
ced я- subjects listened to an imp m Ee 
MSS that described rats biting and агас 

€m in packs, while the controls were m- 
ed to imagine pleasant scenes while they 
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listened to music. Results indicated that 62% 
of imploded subjects and 26% of controls 
handled the rat on the posttest, a statistically 
miñcant difference. 

In a similar experiment, Hogan and Kirch- 
ner (1967) individually treated implosive 
therapy subjects in one session, the length of 
treatment varying from individual to indi- 
vidual (averaging 39 minutes). Control sub- 
jects, also seen one time individually (average 
treatment time = 30 minutes), were asked to 
visualize neutral imagery such as a walk or 
a relaxed vacation. On a behavioral posttest 
67% of imploded subjects and 10% of con- 
trols were able to pick up a rat, a statistically 
significant chi square. 

In the final study, Hogan and Kirchner 
(1968) randomly assigned 30 snake-phobic 
coeds to either an implosive, eclectic verbal 
therapy, or bibliotherapy condition. Implo- 
sion subjects, seen individually by the senior 
author for 45 minutes, were asked to imagine 
snakes bitting them and wrapping so tightly 
around their necks that they slowly strangled. 
Subjects in the eclectic verbal therapy group, 
all treated by the junior author, discussed 
their interpersonal relationships and their 
prior experiences with snakes, being reas- 
sured by the experimenter that snakes were 
harmless. Finally, bibliotherapy subjects were 
handed a book about snakes and their habits, 
that clariñed many of the myths and untruths 
about snakes. After one treatment session, 
70° of the implosive therapy subjects passed 
the posttreatment behavioral test in contrast 
to only 40% of eclectic verbal therapy and 
10% of bibliotherapy subjects who picked up 
the snake. Although the difference between 
implosive therapy and eclectic verbal therapy 
subjects was not significant, both of these 
groups were reliably different than biblio- 
therapy subjects. Following the posttest, the 
failures of both “control” groups were given 
one implosive therapy session. Results of this 
additional session indicated that 67% of these 
subjects then passed the approach test (al- 
though initial treatment and implosive ther- 
apy were obviously confounded). " 

Taken together, the Hogan and Kirchner 
series offer initially impressive and encourag- 
ing findings. On closer inspection, however, it 


is apparent that the investigations are so re- 
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plete with methodological deficiencies that it 
is impossible to draw any firm conclusions 
about the effectiveness of implosive therapy. 
One of the major shortcomings is the absence 
of true control groups in most of the reports. 
For example, “control” subjects reported by 
Hogan (1966) received treatment indistin- 
guishable from ordinary hospital routine. 
Similarly, listening to music (Kirchner & 
Hogan, 1966) or imagining а relaxed vaca- 
tion (Hogan & Kirchner, 1967) were rather 
transparent “pseudotherapies” no matter what 
rationale was given. Thus the possible placebo 
effects of a new and radically different treat- 
ment were left essentially uncontrolled іп 
these three designs. 

Also noteworthy is the apparent disregard 
for the effects of experimenter bias in the 
majority of these reports. It is difficult to see, 
for example, how a ward physician can equate 
treatment groups on several variables (Hogan, 
1966) and still be “unaware that a study was 
being conducted.” Likewise, experimental sub- 
jects in two studies (Hogan & Kirchner, 1967, 
1968) were observed on the posttest by the 
same experimenter who administered the 
treatment condition. Not only is bias obvious, 
but it would also seem that experimental sub- 
jects must have had extreme “demands” 
created to perform on the behavioral test, 
since their “therapist” was conducting the as- 
sessment. Hogan and Kirchner (1968) dis- 
miss this point as unimportant, citing a previ- 
ous study (Kirchner & Hogan, 1966) that 
achieved positive outcomes while controlling 
for experimenter bias and demand. The two 
studies, however, are not really comparable. 
Rather than simply listening to a tape (as in 
the earlier study), Hogan and Kirchner's 
(1967, 1968) implosive therapy subjects spent 
an entire therapy session with the same in- 
vestigator who then observed their posttreat- 
ment performance. 

Additional problems in several of the studies 
also add to the confusion in interpreting the 
results. Hogan (1966), for example, failed to 
equate the length of treatment, therapist ef- 
fects, and pre-MMPI scores in his experimen- 
tal conditions. Hogan and Kirchner's (1967) 
implosion subjects received approximately 
25% more “treatment” than controls, and the 
implosive therapy and control subjects treated 
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by Hogan and Kirchner (1968) were seen by 
different therapists. Since there is a total 
confounding of experimental condition and a 
host of other important variables in these 
three reports, one must seriously question the 
validity of such control groups. 

A number of additional criticisms of 
perimental design also deserve mention. The 
use of the MMPI (Hogan, 1966) as the sole 
dependent measure is questionable as a sign 
of pathology or of treatment. change. Also, it 
appears likely that most of the subjects in at 
least one of the studies (Hogan & Kirchner, 
1968) were not very fearful to begin with, 
since a short verbal discussion (eclectic verbal 
therapy) enabled 40% of the “snake-phobics” 
in this group to handle the feared object. 
Finally, in view of the fact that the research- 
ers have been closely associated with Stampll 
and Levis (the major proponents of implosive 
therapy), it is interesting to note the absence 
of any psychodynamic themes in their treat- 


ex- 


ment procedures, even though they report 
great success. Thus it appears that at least 


one of Stampfl’s major assumptions has been 
called into question (Rachman, 1969) by one 
of the investigators who was instrumental in 
the development of the theory (Hogan). 
Levis and Carrera (1967) randomly a5- 
signed 40 patients to an implosive therapy 
group or to one of three control conditions: 
The subjects in the implosive therapy group 
each received one or two ]-hour standard diag- 
nostic interviews, followed by 10 1-hour 5@5- 
sions of implosion. One implosive therapy 
therapist, experienced in the technique: 
treated. 7 of the 10 subjects in th 
while another experimenter inexperience, 
implosion, treated the remaining 3 subjects; 
Two control groups received «conventiona 
therapy (insight and supportive) for either ^ 
hours (CT, the same length of treatment a 
implosive therapy), ог for a much longer ге 
riod (СТ, mean length = 37 hours). 
last control group, no treatment (WT); 
mained on a waiting list, particip: 
pre- and postassessments. 
that imploded subjects showed а SI£ Ў 
greater decrease ОП 9 out of 10 stant 
scales of the MMPI (the only дере 
variable employed in the study). F bove 
authors point out, 35% of pretest score? a 


re- 
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70 (the cutoff point for pathology) for im- 
plosive therapy subjects dropped into the nor- 
mal range after treatment, while control sub- 
jects in CTi, CT», and WT decreased only 
12%, 99%, and 7%, respectively. Finally, 58% 
of the scores above 80 dropped for the im- 
plosion group (although not necessarily into 
the “normal” range), whereas no such changes 
were observed for the controls. 

While the investigation was referred to as 
only “exploratory,” several procedural errors 
iu the design prevent any but the most tenta- 
tive conclusions from being made. The failure 
to equate groups for length of treatment, cost 
of treatment (some subjects were assessed 
fees, others were not), and experience of ther- 
apists was evident. Also, different. therapists 
Were used for different treatments and the 
groups were seen, apparently, at different 
limes, thus completely confounding the major 
independent variable. Finally, even the sole 
Criterion, the MMPI, is confounded since in- 
Spection of the data reveals unequated pre- 
test scores for experimental and control 
Broups. The authors report greater decreases 
On 9 out of 10 MMPI scales for implosive 
therapy subjects. However, on 7 of these 9 
Scales, imploded subjects had greater pretest 
Scores, making a statistical explanation (re- 
gression toward the mean) as likely as any 
Other for the observed changes. Further, there 
Were no significant differences in posttreat- 
Tent scores between groups; and when im- 
Plosive therapy subjects are compared to all 
Controls combined, only two scales show sig- 
nificant pre-post decreases. One of these 
pe Mf, has little value in pathology as- 
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equated on an initial snake approach test, 
were assigned to one of five treatment groups. 
Each treatment consisted of a 40-minute tape, 
followed immediately by an approach test, 
Eysenck’s Psychoticism-Extroversion-Neurot- 
icism inventory, and a fear thermometer. 
Group 1 listened to an implosion tape, closely 
following the procedure of Hogan (1966), 
while Group 2 received 30 minutes of “hor- 
ror images" followed by 10 minutes of snake 
images. Group 3 received the same treatment 
as Group 2, with the exception that the snake 
images were presented 24 hours later. Group 
4 listened to 40 minutes of horror images, and 
finally, Group 5 listened to 40 minutes of 
pleasant imagery (kittens instead of snakes). 

The only significant findings in the above 
investigation revealed that improvement on 
the fear thermometer was limited to control 
subjects, while implosive therapy subjects re- 
ported a significant increase in fear from pre- 
to posttest. In addition, Group 2 (snake im- 
ages immediately following horror images) 
showed a significant increase in fear. These 
results are in marked contrast to the dramatic 
improvements reported by Kirchner and 
Hogan (1966) even though the implosion 
| tudies received very similar 
treatment (although the authors admit that 
the expectancies for subjects were not the 
same). The investigators concluded that the 
improvement of Group 2, but not Group 3, 
strengthens the hypothesis that nonspecific 
emotional arousal &qesensitizes” phobic stim- 
uli only if the latter are immediately pre- 
sented, but in the absence of any significant 
difference between groups on the avoidance 
test, a more stringent criterion, it is difficult 
to make any firm conclusions. À more recent 
investigation, however, lends some support to 
the notions of Hodgson and Rachman. In this 
study, Watson and Marks (1971) reported 
that imagining either relevant or irrelevant 
fear cues produced significant improvement 
on clinical and attitudinal measures, but only 
irrelevant. fear sessions produced significant 
physiological improvements. Of greater im- 
portance Was the finding that the imagination 
items significantly reduced sub- 


of irrelevant 1 
jective anxiety more than relevant fear ses- 


groups in both s 


sions. 
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Component Analysis of Implosion 


Several reports of implosive therapy have 
attempted to analyze the treatment package 
and break down the procedure into its essen- 
tial components. Such studies can potentially 
reveal more about the treatment and improve 
application, as well as test the theory itself. 
In addition, such component studies shed ad- 
ditional light, perhaps indirectly, on the ef- 
fectiveness of implosion treatment. 

Коа * investigated the effects on implo- 
sion of varying the mode of presentation of 
the phobic stimuli (continuous versus dis- 
tributed) and the duration of the session (20 
versus 40 minutes). Fifty snake-phobics were 
individually administered in a single session, 
one of five treatment conditions, with 10 sub- 
jects randomly assigned to each group. Four 
implosive therapy conditions received either 
continuous or distributed implosion, and for 
either 20 or 40 minutes. The fifth condition, 
Education Therapy entailed giving the sub- 
ject information about snakes for 6 minutes. 
Results showed that all treatments reduced 
fear behavior toward snakes using behavioral 
and subjective measures at pre-, post-, and 
follow-up testing. In addition, it was found 
that the 40-minute continuous implosion 
group was more successful than the other 
three implosive therapy conditions. However, 
the education therapy group, which had been 
"treated" for only 6 minutes, showed greater 
fear reduction than any of the implosive ther- 
apy methods. Moreover, implosive therapy 
groups manifested a temporary increase in 
fear after therapy toward certain previously 
neutral stimuli, an effect absent in the educa- 
tion therapy group. Although this study in- 
volves a few problems (e.g., the possibility of 
experimenter bias in the assessment proce- 
dures and the absence of a no-treatment con- 
trol), the author’s admonition for caution in 
using implosive therapy seems well taken. 

Fazio (1970), in two experiments, at- 
tempted to evaluate the separate effects of 
some of the treatment components of implo- 
sive therapy. In Experiment I, 18 “roach- 


* R. R. Kotila. The effects of education and four 
varieties of implosive therapy on fear of snakes. Un- 
published doctoral Washington State 


dissertation, 
University, 1969 
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phobics" were randomly assigned to one of 
three. treatment conditions: (а) implosive 
therapy, in which subjects listened to a tape 
patterned after Kirchner and Hogan (1966); 
(b) “pseudotherapy,” which involved visualiz- 
ing scenes of irrelevant objects, followed by a 
tape of a frank and rational discussion of um 
relevant. phobic stimuli (which controlled for 
the novelty of imagining frightening scenes); 
and (c) a ‘combination of Treatments a and b. 
All subjects participated in three 29-minute 
sessions, each held at 2-week intervals, The 
results indicated that implosive therapy was 
not effective in producing any significant re- 
duction in phobic behavior. Indeed, both post- 
test and follow-up data showed that reduction 
in fear on most of the behavioral tests was 
inversely related to the amount of implosive 
therapy experienced by treatment groups, wa 
is, Group 1 showed more fear than Group ~ 
which, in turn, showed more fear than 
Group 2. 


In the second experiment by Fazio (1970), 
29 subjects were randomly assigned to the 
same three treatment conditions of Experi- 
ment I. After two pretests in which each sub- 
ject attempted to approach a cockroach, three 
treatment sessions were conducted at 1-week 
intervals, using the identical tapes as em 
ployed in the first study. After completing Ha 
phase, each subject then received two addi- 
tional sessions of a different treatment. RES 
sults again showed that implosive qe 
subjects failed to show any signiñcant P. 
provement. In fact, three imploded d 
who were not considered phobic on the ит 
of their pretests, became fearful of the e H 
lus after only two implosive therapy pee 
leading the author to conclude that pem. 
cussion component was the most impor 
factor in producing any improvement. 

Prochaska (1971) tested one of th 
hypotheses of implosive therapy by р 
ing test-anxious college students with s 
tom cues" (e.g, studying for an exar 
dynamic cues (e.g. 


castration). Results showed that th 
ficantly 


) or 


parental 


tom cue group indicated signi t 
improvement on an intelligence pi 
major dependent variable) than pos 
controls, while the dynamic cue 
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fered reliably from only a no-treatment con- 
dition, the symptom and dynamic cue groups 
not differing from each other. In addition im- 
ploded subjects were reported to experience 
less anxiety on their final exams following 
treatment and showed modest increases in 
their grades, The study, however, contains 
Several problems in design, the most important 
being the complete confounding of experi- 
mental conditions with a variety of extraneous 
uncontrolled variables. For example, some of 
the subjects were tested in a large group, some 
in small groups, and some individually. Like- 
Wise, treatment conditions varied, with some 
Subjects listening to a taped session through 
earphones in a language laboratory, while 
Others sat around in a small group listening 
together to a tape recorder; some groups were 
as large as 23, others were as small as 3. 
Finally, the use of the scores of the intelli- 
Sence tests as criteria for reduced test anxiety 
is highly questionable, particularly in view 
9f the fact that the small changes that were 
Teported were accompanied with neither men- 
lion of reliability nor standard error of mea- 
surement. i 


omparative Studies with Systematic 
esensitization: Implosive Research 


Both implosive therapy and flooding have 
y to systematic 


Je е 
1 €n compared experimentall 
“esensitization in several studies. From such 
be resolved, 


Btadua, (counterconditions) anxiet 
sup ated hierarchy, with the result that the 


Je: x 1 E 
trea ^ experiences very little anxiety during 
foo a On the other hand, implosive and 
сіру techniques based on extinction prin- 


es Е а 
lor pr Dect the patient to intense anxiety 
Nen а periods of time. : 

"ck (1968) and Wolpe (1958: Wolpe 
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& Lazarus, 1966) predicted that subjects who 
were presented with anxiety-eliciting stimuli 
that were too intense (e.g., in implosion) 
would actually become “sensitized” to the 
stimuli and levels of anxiety and avoidance 
behavior would increase. Stampfl and Levis, 
in contrast, would argue that if subjects did 
not experience intensive emotion, then little 
extinction would result. It appears, then, that 
the generally accepted theories underlying 
each technique cannot adequately account for 
the successes of the other treatment. Com- 
parative studies, therefore, will not only an- 
swer questions of “economy” and efficiency 
but will also provide very important data for 
the different theoretical assumptions. 

To date, nine studies have appeared test- 
ing the relative efficacy of desensitization and 
implosion. Three of these investigations have 
shown systematic desensitization to be su- 
perior to implosive therapy (implosion sub- 
jects either not differing from controls or ac- 
tually becoming worse after treatment), with 
the remaining six showing no significant dif- 


ferences. 

A well-designed study was performed by 
Mealiea (|1967|, Mealiea & Nawas, 1971) 
who randomly assigned 50 snake-phobic coeds 
to one of five groups (systematic desensitiza- 
tion, implosive therapy, and three controls) 
after they were matched on a behavior test 
and a fear thermometer. Results indicated 
that the systematic desensitization group sig- 
nificantly showed the greatest decrease in 
snake-phobic behavior, while imploded sub- 
jects did not differ from any of the controls. 
A 1-month follow-up revealed that systematic 
desensitization subjects continued to exhibit 
the greatest decrease in phobic behavior, while 
implosive therapy subjects now showed the 
least amount of change. The author con- 
cluded, then, that the counterconditioning 
model was well supported and that the previ- 
ous success of some of the implosive studies 
her due to confounding or extraimplo- 
such as reality testing and 
pist. 


were eit 
sive components, 
supportive aspects of the thera 


a. The comparative effectiveness of 
ation and implosive therapy in 
e phobia. Unpublished doc- 
rsity of Missouri, 1967. 


5 W. L. Mealie: 
systematic desensit 
the elimination of sna 
toral dissertation, Unive 
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Using a similar design, nearly identical re- 
sults were reported by Willis (|1968]," Willis 
& Edwards, 1969) supporting the superiority 
of desensitization treatment. However, this 
study is marred by the fact that desensitiza- 
tion and implosive therapy groups were 
equated not on an actual behavior test, but on 
the basis of their Fear Survey Schedule scores. 
In view of a recent study by Bernstein (in 
press) who found that 96% of subjects clas- 
sified as “phobic” on the basis of their Fear 
Survey Schedule score picked up and handled 
the phobic object on a behavioral test, it is 
conceivable that the groups in Willis’ investi- 
gation differed in actual avoidance behavior 
before treatment began. 

Barrett ([1968], 1969) also concluded 
that systematic desensitization was the pre- 
ferred choice of treatment, Even though initial 
findings showed no significant differences be- 
tween implosion and desensitization on any 
of the dependent measures used, further anal- 
ysis of the data revealed subtle differences 
between the groups. Barrett noted that al- 
though the means of both groups were identi- 
cal, the variances were not. Desensitization 
subjects, for example, showed a consistent 
and continuous reduction in variance from 
preassessment to postassessment to follow-up. 
Implosion subjects, on the other hand, re- 
vealed a significantly greater variance change 
than both systematic desensitization and con- 
trols, showing a significant reduction in vari- 
ance from pre- to postassessment, but an in- 
crease from postassessment to follow-up. Not- 
ing the arguments of Bergin (1963, 1966, who 
emphasized the possibility of “deterioration 
effects" from therapy which may be hidden 
when only group means are analyzed), Bar- 
rett showed that some implosive therapy sub- 
jects improved, but others got worse, whereas 


desensitization subjects were considerably less 
variable. 

SUR. W. Willis. А study of the comparative effec- 
tiveness of systematic desensitization and implosi 
therapy. Unpublished doctoral dissertation, Unive 
sity of Tennessee, 1968. 

TC. L. Barrett. Systematic desensitization therapy 
(SDT) versus implosive therapy (IT): А compara- 
tive study of the efficiency of two behavior therapies 
in reducing snake phobic behavior in otherwise nor- 


mal adults, Unpublished doctoral dissertation, Uni- 
versity of Louisville, 1968. 
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Six comparative studies have reported es- 
sentially no significant differences between ex- 
perimental treatments in producing behavioral 
change (Borkovec;> Carek;" Horowitz, 1970; 
Jacobson;'" Kirts;'' McGlynn ™). However, 
the investigations by Kirts (who used a con- 
ditioned galvanic skin response as the de- 
pendent variable in comparing “Wolpe” and 
“Stampfl” groups) and Horowitz (who manip- 
ulated treatment conditions through different 
hypnotic suggestions) must be considered 
analogue studies and, therefore, generalization 
to actual desensitization and implosion treat- 
ment may be especially difficult. It is impor- 
tant to note, however, that Horowitz's im- 
plosion subjects, while equal to desensitization 
(relaxation) subjects at posttreatment assess- 
ment, were no longer significantly different 
than controls at a follow-up. Е 

The study by Jacobson reported changes 10 
pulse rates for implosive therapy and desen- 
sitization subjects in the predicted directions, ` 
and the author concluded that both treat- 
ments were supported, although it is difficult 
to justify this interpretation in view of the 
fact that there were no significant differences 
on verbal or motor measures. It does indicate 
the need, however, to use all three types 0 
measures in assessing anxiety during treat- 
ment and at pre-, post-, and follow-up assess- 
ment (a suggestion previously emphasized y 
several researchers, e.g., Craighead, in press; 
Lang, 1969; Mathews, 1971). 


ST. D. Borkovec. The comparative effectiveness af 
systematic desensitization and implosion therapy 29 
the effect ої expectancy manipulation on the zm d 
tion of fear. Unpublished doctoral dissertation, 
versity of Illinois, 1970. m 

R. G. Carek. A comparison of two behavio 
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In contrast to many of the studies reported 
above which have oíten been marred by 
various methodological deficiencies, the study 
by Borkovec (see Footnote 7; 1972) repre- 
Sents a sophisticated and well-controlled de- 
sign. On the basis of a stringent screening 
Process, 50 undergraduate females were ran- 
domly assigned from stratified blocks based 
on their approach scores to one of eight treat- 
ments. The treatment conditions consisted of 
four therapy groups (systematic desensitiza- 
tion, implosive therapy, avoidance response 
placebo, and a no-therapy control) factori- 
ally crossed with two expectancy conditions 
(positive or no expectancy). Of particular in- 
terest was the "avoidance-placebo" control 
Which tested whether subjects who were in- 
Structed to actually make avoidance responses 
Would, in fact, show increases in their phobic 
behaviors since both desensitization and im- 
plosive therapy models assert that avoidance 
behaviors reinforced by anxiety reduction are 
responsible for the maintenance of conditioned 
anxiety, 

All experimental subjects were seen for 
four 1-hour sessions at 1-week intervals. The 
results indicated that both desensitization 
and implosion (regardless of expectancy con- 
dition ) produced significantly lower physio- 
gical arousal (pulse rates) and tended to 
Produce decreases in self-reported fear, al- 

Ough the two treatments did not signifi- 
cantly differ from each other. The fact, how- 
ever, that the avoidance response group also 
Showed nearly equal improvement raises a 
Possible problem for both systematic desensi- 
veation and implosive therapy theory. Borko- 
(T. (1972) adopted an habituation model 
ew & Mathews, 1968) to explain these 
of 55, Suggesting that the important aspect 
loneq аРУ may be the repetition of condi- 

Ee Imuli in the absence of any primary 
comm e Consequences (the OnE component 
Sly, Оп to systematic desensitization, implo- 
erapy, and avoidance control 


оць) the 
Or А : 
ing  S'eater interest in the study is the find- 
avio а strong expectancy effect on overt be- 
D yes E РЕЯ 
Bree Measures of fear and, to а lesser de- 
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decreases in physiological arousal, they did 
not differ from no-treatment controls on the 
avoidance test if they did not have the exe 
pectancy that they would, in fact, change. 4 

Borkovec concluded that traditional learn- 
ing mechanisms (e.g., counterconditioning or 
extinction) may explain physiological changes 
in therapy, but it is the cognitive factors 
(e.g, expectancy or demand characteristics) 
that produce subjective and overt motor dif- 
ferences. These findings are consistent with 
those of Dee '" who found that positive in- 
structions preceding implosive therapy had a 
weak but facilitating effect in reducing fear. 
The results, however, stand in contrast to the 
findings of Layne '* who found no significant 
expectancy effect in a study in which implo- 
sion, listening to music, and no treatment pro- 
duced the same amount of behavioral change. 


Flooding 


The term “flooding” was coined by Polin in 
1959, although there were many previous sug- 
gestions that extinction could be successfully 
accomplished by arranging intense emotional 
reactions to take place in the presence of 
phobic stimuli (Holland & Skinner, 1961; 
Masserman, 1943; Solomon et al., 1953; Solo- 
mon & Wynne, 1954). The first implementa- 
tion of the technique with human subjects 
was reported by Malleson (1959) who em- 
ployed a hierarchy similar to that used in 
desensitization, although instead of encourag- 
ing relaxation, Malleson instructed the client 
to feel more írightened as he proceeded 
through the items. "The results showed that 
the client, after an initial increase in distress, 
showed rapid recovery and a complete cure. 
Essentially, the same promising results were 
reported in another early study by Boulou- 
gouris and Marks (1962) who found complete 
cures in a number of different phobic patients 
after flooding sessions. 

Wolpin and Raines (1966) treated six 
snake-phobic women for a maximum of five 


13C, K. Dee. Instructions and the extinction of a 
learned fear in the context of taped implosive ther- 
apy. Unpublished doctoral dissertation, University of 
Iowa, 1970. 

11C. C. Layne. The effects of suggestion in im- 
plosive therapy for fear of rats. Unpublished doc- 
toral dissertation, Southern Illinois University, 1970. 
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sessions using one of three experimental con- 
ditions: (a) visualization of a 20-item hier- 

 "archy with no training or instructions of any 
kind regarding relaxation; (5) completion of 
a 20-step hierarchy with muscles deliberately 
tense; and (c) presentation of the most in- 
tense scenes from the top of the hierarchy 
with no training or instructions in regard to 
either relaxation or tension. After four treat- 
ment sessions, all subjects were able to handle 
and fondle the originally feared snake and at 
a follow-up, 3 to 5 weeks later, five of the 
subjects were still able to hold the phobic 
object. Although the investigators concluded 
that some form of desensitization or extinc- 
tion was probably the major factor in therapy, 
they readily conceded that just as important 
a factor may be the strong fulfillment-of- 
demand characteristics. The most serious de- 
ficiency of this study is, obviously, the fact 
that few conclusions can be drawn from an 
experiment having two subjects in each treat- 
ment condition. Additional limitations stem 
from the failure to ensure equal length of 
treatment, and the absence of an attention- 
placebo or no-treatment control. 

Rachman (1966) used flooding in the treat- 
ment of three subjects with a demonstrated 
aversion to spiders, comparing them to three 
subjects who had previously undergone de- 
sensitization treatment and three no-treatment 
controls. Flooding subjects were instructed 
to imagine a given scene for 2 minutes at a 
time, while the experimenter attempted to 
"vivify" the experience so that maximum 
anxiety would be elicited. Treatment lasted 
for 10 sessions, at 2-week intervals, with each 
session averaging 10 scene presentations. 

Results indicated that flooding and control 
subjects showed no improvement on either an 
avoidance test or a fear thermometer, whereas 
matched desensitization subjects had mark- 
edly improved. The use of such a short ex- 
posure (2 minutes) must be questioned, how- 
ever, particularly in view of the fact that 
most implosive studies have used much longer 
periods for scene presentations. In fact, БНБ 
of the crucial factors in implosion and flood- 
ing may be the exposure time of visualization. 
Since so few subjects were seen, and since 
flooding, control, and desensitization subjects 
Were seen at different times, caution should 
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be taken in drawing lirm conclusions from 
this investigation. 

A more recent study (Miller & Levis, 1971) 
addressed itself specifically to the variable of 
exposure and is relevant here even though 
the treatment was termed “implosive ther- 
apy." In this investigation, 40 high school 
snake-phobic girls were assigned to treatment 
in either a 0-, 15-, 30-, or 45-minute Os 
posure condition. The group receiving a 15- 
minute exposure displayed the most fears as 
indexed by contact and distant measures with 
no differences shown between the other three 
conditions on any of the measures. The 
authors note, however, that a significant num- 
of subjects in each group showed improve 
ment on the approach test. Miller and Levis 
(1971) interpret their findings in the follow- 
ing way: 


[Subjects given a short exposure, O minutes! " 
capable of defending against complete sensory ink 
of the stimuli associated with the phobic t 
ulus (PTS) and/or against the i 
additional fear eliciting thoughts or imag У Р 
to the PTS. With more exposure, these defenses ДЕ, 
well break down resulting in an increase in the lev 
of fear. Termination of PTS at this point (15 min- 
utes) would be expected to prevent extinction ant 
result in the conservation of a high level of fear 
However with continued exposure, these new cues 
should undergo an extinction decrement [ p. 201. 


В ser and 
Although the speculations of Miller 4 
); with 


others (Rachman, 1969; Staub, | 
pears that the actual exposure times 
study are very much different than 


For example, 


previous investigations. rovemen 


man's (1966) failure to show imp 5 0 
from flooding has been explained in MM 
a very short exposure time (2 minutes), М 
Wolpin and Raines (1966) findings 
been interpreted as successful because М jous» 
“long” exposure (10 minutes). It !5 aS ЕШ 
however, that the latter's “long” an 
time is much shorter than the duration in 
Miller and Levis’ “short” exposure efor: 
utes). The results of each study. d of 
cannot be theoretically explained in te á 
relative (short versus long) exposures atio” 
the actual duration of stimulus e e, 
varies, in fact, from one study (0 ай еб. 
is important, then, if such important 
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Cal issues are to be resolved, that future re- 
Search use consistent. procedures. 

An important series of investigations, briefly 
mentioned here, are those of Morrie Baum. 
Using primarily subhuman subjects, Baum 
has contributed a number of important em- 
Pirical studies demonstrating the success of 
flooding (ог "response prevention," the 
procedure has been termed) in the elimination 
Of avoidance behavior (cf. Baum, 1970, for a 
Teview and discussion of this research). Al- 
though generalization to human subjects may 
at times be difficult from such animal studies, 
the parsimony of Baum is refreshing and it 
Seems quite likely that laboratory analogues 
such as these will shed considerable light on 
Similar procedures with humans. A few recent 
Investigations (e.g, Heath;'* Hughes;'* Pop- 
реп!) have already attempted. comparative 
Studies between flooding and desensitization 


With subhumans that have been quite sug- 
Bestive, 


Flooding versus Systematic Desensitization 


Five compa 
Sensitiz: 


ative studies of systematic de- 
ation and flooding have been reported, 
indicating in general, equivocal and contra- 
dictory findings, and offering little conclusive 
Evidence for the superiority of either proce- 
dure, The first investigation was conducted 
РУ Strahley '* who found a counterphobic 
Teatment (flooding) to be significantly more 

€ctive than desensitization in reducing fear 
Оп the Fear Survey Schedule and an avoid- 
ance test, although there were no significant 
differences on two additional measures. In 
IW of Several problems in design (absence 
°F appropriate no-treatment and “attention- 
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hibit; H. Heath, A comparison of reciprocal in- 

Condit, and flooding in dec ng the strength of a 

isser ed avoidance response. Unpublished doctoral 
зң Os Temple University, 1968. 


tions o, Н. Hughes, An investigation of the assump- 


tion qy; MPlosive therapy and systematic desensiti 
lisse an animal analogue. Unpublished doctoral 
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on, University of Georgia, 1970. 


Ts Poppen. Counterconditioning of condi- 
ression, Unpublished doctoral dissertation, 
sity, 1968, 

«ру, thley, Systematic desensitization and 
s. "€ treatment of an irrational fear oi 


“Published doctoral dissertation, University 
essee, 1965, 


placebo” controls, and procedural inconsist- 
encies both between and within treatment 
conditions), caution must be taken in draw- 
ing any strong inferences. Strahley, in fact, 
concluded that the success of flooding was 
probably more a function of extratherapy 
variables than of an intrinsic superiority of 
the technique. 

In an analogue investigation, Brock 19 
found no significant differences between a 
flooding group and a desensitization treatment 
in the extinction of avoidance behavior. How- 
ever, lack of no-treatment controls and un- 
usual treatment procedures make the study 
difficult to generalize to the clinical use of 
desensitization and flooding. 

De Moor (1970) reported that both de- 
sensitization and flooding produced significant 
reductions in fear, the data also suggesting 
that desensitization was more effective since 
flooded subjects returned to their pretest leve] 
on follow-up testing. The study, however, 
can be criticized for several procedural errors, 
De Moor stated that desensitization subjects 
received a maximum of 10 minutes of relaxa- 
tion training (method unspecified) in each 
session. Other studies (e.g, Paul, 1966, 
1969d) have reported an average of 25 min- 
utes as the minimum time necessary to pro- 
duce complete muscular relaxation. In addi- 
tion, the fact that systematic desensitization 
subjects spent up to half their 20-minute ses- 
sions becoming relaxed, means that only 10 
minutes per session were left for desensitiza- 
tion proper to be conducted. Flooding sub- 
jects, on the other hand, received 20 minutes 
of treatment and even though this was twice 
the amount of "treatment" that desensitiza- 
tion subjects received, exposure time may 
have still been too short, in view of recent 
studies (e.g., Kotila, see Footnote 3: Miller 
& Levis, 1971). 

Calef and MacLean (1970), after treating 
“speech-anxious” college students, reported 
that both flooding and desensitization pro- 
duced significantly more improvement than 


"TL. D. Brock, The efficacy of various extinction 


procedures on a conditioned avoidance response in 


humans: An experimental analogue. Unpublished 
doctoral dissertation, Southern Illinois University, 
1967. 
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controls on the Taylor Manifest Anxiety Scale 
and the Personal Report of Confidence as a 
Speaker. Although the authors concluded that 
reaction inhibition (flooding) was the more 
efficient technique (since it was a simpler pro- 
cedure), deficiencies in design, measures, and 
statistical analysis make the study of little 
value (Blanchard, 1971). For example, the 
use of the Personal Report of Confidence as a 
Speaker as an adequate dependent variable of 
change must be seriously questioned. Blan- 
chard (1971) has pointed out that such self- 
reports are of value only if they are accom- 
panied by actual behavior measures. This can 
be seen in a study by Karst and Trexler 
(1970) who found speech-anxious subjects 
significantly improved on verbal measures, in- 
cluding the Personal Report of Confidence as 
a Speaker, but did not differ from controls on 
behavioral assessment. Other problems in- 
clude the complete confounding of treatment 
by group and of treatment by therapist (since 
only one therapist was used) and the inclu- 
sion of both high- and low-fearful subjects in 
the same treatment group. Finally, the sta- 
tistical analysis of Calef and MacLean pre- 
sents so many gross errors, that the reported 
conclusions are completely unsupported by 
the data (Blanchard, 1971). 

The most recent comparative study (Bou- 
lougouris, Marks, & Marset, 1971) offers the 
only clear evidence that flooding might be 
superior to desensitization (on at least some 
behavioral measures). Sixteen phobic patients 
were seen in a crossover design, in which they 
received six sessions of flooding followed by 
six sessions of desensitization, or received the 
same two treatments in the reverse order. Re- 
sults showed that both experimental condi- 
tions produced Significant. improvement on 
clinical ratings, although flooding subjects 
were significantly better on some of the items. 
Physiological assessment yielded similar find- 
ings and on no measure w 
superior to flooding, 
typical desensitization 
results, of course, diffic 
practice. In addition, 
that severa] patients ir 
felt that they were “с 
could face the 
not shared by 


as desensitization 
The departures from 
and flooding make the 
ult to relate to clinical 
it is interesting to note 
1 the flooding procedure 
hallenged" to prove they 
phobic stimulus—a demand 
the desensitization subjects, 
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Discussion 

It would appear that many of the empirical 
reports of implosion and flooding have re- 
sulted in contradictory and seemingly para- 
doxical findings. For example, the results of 
Rachman (1966) whose subjects showed no 
improvement from flooding, stand in marked 
contrast to the positive outcomes of Wolpin 
and Raines (1966). Wilson (1967) noted that 
Wolpin and Raines’ subjects were asked in 
their imagination to rehearse accomplishment 
of the final criterion of fear reduction, while 
Rachman’s subjects were required to imagine 
scenes so horrible that they might only occur 
in nightmares. Wilson concluded that the ef- 
fects of flooding (or implosion) depend on 
the type of responses which are rehearsed in 
therapy. Staub (1968), however, pointed toa 
few studies reported by Hogan and Kirchner 
(1967, 1968) and noted that the latter E 
vestigators achieved considerable Success anc 
still had subjects imagine extremely ripis 
ing scenes which were irrelevant to e d 
criterion responses. The important me к 
Staub suggested, which determines the efi ы 
tiveness of implosion and flooding, is the du 
tion of continuous exposure to the fear-evo* 
ing stimuli. : 

In a later article, Rachman (1969) P 
addressed himself to the question of exposure 
He noted that premature termination © 
treatment session may result in an exacer 
tion rather than a reduction in wor and 
possibility was raised by Eysenck (Ra 
Wolpe (1958, 1969) and can also be us um 
from the maximal habituation theory sd 
(Lader & Mathews, 1968; Lader d cord- 
1966). Rachman (1969) noted that, 4€ pro" 
ing to this latter theory, desensitization , 
ceeds more effectively when the patient © 
a low level of activation. If anxiety-evo. Я 
stimuli are introduced when the patien in 
a very high level of arousal, an t пріо" 
the magnitude of anxiety may parse 
sion or flooding, then, may be exP ^ arous? 
assuming that the subject’s level О й 
is eventually reduced (as а caan {б 
the prolonged and intense stimulatlo y to th 
point where he can habituate t ; «the 
anxiety-evoking stimuli. In athen a habitu- 
subject is brought to a point of ral 
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ation by first exhausting his capacity for emo- 
tional arousal | Rachman, 1969, p. 299]." 

The recent report by Borkovec (1972) also 
Suggested that an habituation model might 
eXplain the apparently paradoxical finding 
that an avoidance-placebo group (which actu- 
ally rehearsed avoidance responses to phobic 
stimuli) showed considerable improvement in 
fear reduction, He noted, however, that while 
learning mechanisms were able to account for 
Physiological changes (in implosion and de- 
Sensitization), improvement in subjective and 
Overt behaviors could be best explained in 
terms of Cognitive factors, particularly ex- 
Pectancy, 

Several animal studies have shown more 
Consistent success with flooding procedures 
than investigations with human subjects have, 
although several problems still remain. Baum 
(1970) has pointed out, for example, that the 
two-factor theory cannot explain the fact that 
animals still show overt signs of fear even 
though the avoidance response is no longer 
Present, Moreover, Wilson and Davison 
(1971), in a comprehensive review of the 
animal literature, concluded that the counter- 
Conditioning Procedure was more effective in 
reducing conditioned fear than the orthodox 
classica] extinction procedures. Thus it ap- 
Pears that even in the subhuman research, 
looding procedures have resulted in equivocal 
Outcomes, 

Combining the findings, then, of both hu- 
an and Subhuman research, it seems possible 
9 propose that prolonged exposure (if re- 
Peated long enough) may lower physiological 
arousal to a degree sufficient for subjects to 
Make Approach responses (covert or real). In 
"man Subjects, cognitive factors may facili- 
late this Process by creating strong demands 

Perform or by establishing positive ex- 
or oae for change, resulting ina EE 
Anxiet avoidance behavior and subjective 
vated j^ F urther, subjects may then be 90" 
et 0 Teinterpret their. physiological te 
Заде (as Schachter & Singer, 1962, have 
With t *d) so that they are more consonant 
. © Subjects overt behavior. 


Menta "parent. from this review that experi- 
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"E n Investigations of implosive therapy 
2 O0 


* dng have failed to demonstrate con- 


vincingly the effectiveness of these procedures 
in the elimination of phobic and avoidance 
behaviors, Although a few studies have re- 
ported positive findings (the majority con- 
ducted by staunch advocates of implosion), 
most of these investigations are so badly 
marred by methodological deficiencies, mis- 
interpretations of data, invalid assumptions, 
and lack of proper controls 
sible to conclude any cause-effect relation- 
ships between treatment and outcome. 

An additional difficulty with the majority 
of the reports is the fact that most of the 
studies have employed analogue designs in 
which the target behaviors have been fear 
and avoidance of snakes, insects, or rats. The 
problems of generalization from analogue to 
clinical populations have been previously dis- 
cussed by several researchers (e.g., Cooper, 
Furst, & Budger, 1969: Lader, 1967; Lang, 
1969) and the same criticisms are applicable 
to much of the implosion literature, Of par- 
ticular importance is the finding that the 
verbal reports of analogue fears are often 
poorly correlated with actual behavioral mea- 
sures (e.g., Davison, 1968: Geer, 1965; Lang, 
1968; Lang, Lazovik, & Reynolds, 1965). 
Thus, studies that have used only verbal mea- 
sures to select subjects are especially difficult 
to interpret, Moreover, Bernstein (in press) 
has reported two studies showing that even 
performance on actual behavioral avoidance 
tests may be strongly influenced by expect- 
ancy, the assessment setting, and the demand 
characteristics of the testing situation, 

It would seem, therefore, that very subtle 
factors may have greatly influenced many of 
the investigations reviewed here, and the im. 
portant implications of Bernstein's findings, 
like the suggestions of Borkovec (1972), need 
to be considered fully and systematically ex. 
amined in future research. The role of cogni- 
tive processes in systematic desensitization 
has recently attracted great interest (OT 
Davison & Valins, 1968, 1970; Gaupp, Stern, 
& Galbraith, 1972: Wilkins, 1971) and it 
appears likely that similar factors may, like- 
wise, account for much of the variance in 
implosive and flooding procedures, It remains, 
however, an empirical question that has so far 
(with few exceptions) not been tested, 


that it is impos- 
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CONCLUSIONS 


1. The literature in implosive therapy and 
flooding has not only been less extensive and 
less sophisticated than research in desensiti- 
zation, it has also shown the treatments to be 
far less effective. 

2. The comparative studies of implosion 
and flooding with systematic desensitization 
have yielded similarly equivocal results, with 
only one study suggesting the superiority of 
flooding procedures. 

3. In view of these findings, it seems rea- 
sonable to posit the view that in many of the 
instances in which implosion and flooding 
have proven successíul, it has been the de- 
mand characteristics of the situation, the ex- 
pectancy of the subjects, or other extra- 
therapy variables that have contributed the 
major portion of the outcome variance. 

4. The theoretical basis of implosion also 
seems, at present, to be unsupported. Cer- 
tainly the dynamic speculations of Stamp 
and Levis seem unwarranted since the tech- 
nique, when it has worked, has more often 
than not omitted such psychoanalytic aspects 
from treatment. 

5. The few studies that have shown posi- 
tive treatment effects indicate the need for 
more systematic and well-controlled future 
research, particularly in the investigation of 
cognitive factors in therapy. 

6. Finally, even if the effectiveness of im- 
plosion were proven, there would still be 
realistic and ethical considerations to be made. 
It is quite likely that very few clinicians 
would find implosion “palatable” and would, 
therefore, be unable or unwilling to utilize 
such a technique. One must seriously question 
the desirability of having a client undergo 
such stress and trauma in the name of treat- 
ment, when systematic desensitization is at 
least as effective a procedure. 

These factors may very well account for 
the paucity of research in the area. Neverthe- 
less, they remain academic until the more 
basic empirical validation and theoretical sup- 
port are demonstrated. It appears, then, that 
the early questions of Stampfl and Levis 
(1967a) are still unresolved and their cau- 
tions are still prominent, namely, that “fur- 
ther research on the efficacy of the technique, 
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theory, and interaction of the two is still 
needed before any definite claims | for implo- 
sive therapy | can be made |p. 502 |. 


REF ENC 
Principles of behavior modification. 
Holt, Rinehart & Winston, 1969. 
Systematic desensitization versus 1M- 
Journal of Abnormal Psychology 


BANDURA, : 
New York 

Barrett, C. L 
plosive therap; 
1969, 74, 587-592. 

Baum, M. Extinction of avoidance 
through response prevention (flooding). Psycho- 
logical Bulletin, 1970, 74, 276-284. ы 

Baum, М., & Poser, E. G. Comparison of flooding 
procedures in animals and man. Behaviour Re- 
search and Therapy, 1971. 9, 249-254. E 

BERGIN, A. E, The effects of psychotherapy: Nega- 
tive results revisited. Journal of Counseling Psy- 
chology, 1963, 10, 244-250. J 

BERGIN, А. E. Some implications of psychotherapy 
research for therapeutic practice. Journal of Ab- 
normal Psychology, 1966, 71, 46. 3 

BERNSTEIN, D. A. Behavioral fear assessment: So 
iety or artifact. In H. Adams & P. Unikel (Eds 
Issues and trends in behavior therap. pringtield, 
Illinois: Charles C Thomas, 1973, in pre | Ў 

Вілхснлко, E, B. А comparison of re iprocal in- 
hibition and reactive inhibition therapies in BE 
treatment of speech anxiety: А methodologica 
critique. Behavior Therapy, 1971, 2, 103-106. 

Borkovec, T. D. Effec incy on the out- 
come of systematic des ion and implosive 
treatments for analogue anxiety, Behavior Ther- 
apy, 1972, 3, 29-40. 

BOULOUGOURIS, J. C. & Marks, I. M. 
(flooding): А new treatment for phobias. 
Medical Journal, 1962, 2, 721-723. 

BOULOUGOURIS, J. C. MARKS, I. M, & Mars 
Superiority of flooding (implosion) to desens! js 
tion for reducing pathological fear. Behaviour 
search апа Therapy, 1971, 9, 7-16. 

Cater, R, A, & MacLean, С. D. А compar ther- 
reciprocal inhibition and reactive nae, vior 
apies in the treatment of speech anxiety. Beha 
Therapy, 1970, 1, 51-58. patel 

Cooper, A., Furst, J. B, & Buperr, W. Н. А Pl of 
commentary on the usefulness of studying Гай СЕ 
snakes. Jourmal of Abnormal Psychology, T1905 
74, 413-414. € 

Craicueap, W. E. The role of muscular rel Ys 
in systematic desensitization. In R. Rubin KE rk 
Advances in behavior therapy. Vol. 5. New Yo 
Academic Press, 1973, in p 
Avisox, G. C. Systematic 

A Mec E process. Journal of Abn 
Psychology, 1968, 73, 91-99. . 

Davison, G. C., & Улыхз, 5. On self-proe PP 
drug-produced relaxation. Behaviour Researe 
Therapy, 1968, 6, 401—402. 

Davisox, G. С VALINS, S. 
critique regarding scli-produced 


respondinz 


of exp 


Im plosi on 
British 


Р. 


ison 0! 


as 


sensitization 
oun ormal 


1 
luced aní 


h ane 


ines 
A reply to Wolpe 
and drug-pro- 


duced relaxation. Behaviour Research and. Ther- 
apy, 1970, 8, 107-108. 

Dr Moon, W. Systematic desensitization versus pro- 
longed high intensity stimulation (flooding). 
Jounal of Behavior Therapy and Experimental 
Psychiatry, 1970, 1, 45-52. 
Еүѕехск, H, J. A theory of the 
anxiety/fear responses. Behaviour 
Therapy, 1968, 6, 309-322. 

Елио, A. Е. Treatment components in implosive 
therapy, Journal of Abnormal Psychology, 1970, 
76, 211-219, 

Gaupp, L, A, STERN, R. 
False heart-rate f 


incubation of 
Research and 


M. К Смвкати, G. G. 
dback and recipro inhibition 
by aversion relief in the treatment of avoid- 
„ance behavior. Behavior Therapy, 1972, 3, 7-20. 
Grrr, J. Н. The development of a scale to measure 

fear. Behaviour Research and Therapy, 1965, 3. 


‚ К. J., & Ractuumay, S. An experimental in- 
Vestigation of the implosion technique. Behaviour 
Research and Therapy, 1970, 8, 21-28. 

OGAN, R. А. Implosive therapy in the short term 
treatment of psychotics. Psychotherapy: Theory, 


Therapy considerations. Behaviour Re- 
Search and Therapy, 1969, 7. 177-184. 

Hog, N, R. A, & KIRCHNER, J. H. A preliminary re- 
Dort of the extinction of learned fears via short 
term implosive therapy. Journal of Abnormal Psy- 
chology, 1967, 72, 106-111. 

OGAN, R. A, & KIRCHNER, J. H. Implosive, eclectic 
verbal and bibliotherapy in the treatment of fears 
of snakes Behaviour Research and Therapy, 1968, 
6, 167-171, 
OLLAND, J. G., & Skinner, В. F. The analysis of 
behavior, New York: McGraw-Hill, 1961. 
OROWITZ, S, L, Strategies within hypnosis for re- 
ducing Phobic behavior. Journal of Abnormal Psy- 
chology, 1970, 75, 104-112. 

Karst, T. O, & Тахо, L. D. Initial study using 
Sed-role and rational-emotive therapy in treating 
Public-speaking anxiety. Journal of Consulting 


me Clinical Psychology, 1970, 34, 360-366. 
IRCHNER, 7. By 
Variable 

apy; 

0 


& Hocax, R. A. The therapist 
le in the implosion of phobias. Psychother- 
Theory, Research, and Practice, 1966, 3, 102- 


H. Palmar skin conductance measures in 
‚ > and phobic states. Journal of Psychoso- 
esearch, 1967, 16, 271-281. 


moda H., & Marnews, A. M. A physiological 
5 Phobic anxiety and desensitization. Be- 
е esearch and Therapy, 1968, 6, 411-421. 
Sedatiy E Н. & Wise, L. Physiological measures, 
ford Pris and morbid anxiety. London: Ox- 
ANG, p Diversity Press, 1966. 


^x Fear reduction and fear behavior: Prob- 
n 


treating a construct, In J. M. Shlien 


IMPLOSIVE THERAPY AND FLOODING 333 


(Ed.), Research in psychotherapy. Washington, 
D. C.: American Psychological Association, 1968. 

Laxc, P. J. The mechanics of desensitization and the 
laboratory study of human fear, In C. M. Franks 
(Ed.), Behavior therapy: Appraisal and status. 
New York: McGraw-Hill, 1969. 

Lane, P. J., Lazovik, А. D, & Rrvxorps, D. Je 
Desensitization, suggestibility, and pseudotherapy. 
Journal of Abnormal Psychology, 1965, 70, 395- 
402. 

Levis, D. J. Implosive therapy 
human analogue, the strategy, and the technique, 
In S. G. Armitage (Ed.), Behavior modification 
techniques in the treatment of emotional 
orders. Battle Creek, Mich.: V. 
1967. 

Levis, D. J., & Carrera, R. N. Effects of 10 hours 
of implosive therapy in the treatment of out- 
patients: A preliminary report. Journal of Ab- 
normal Psychology, 1967, 72, 504-508 . 

Matteson, N. Panic and phobia: A possible method 
of treatment. Lancet, 1959, 1, 225-227, 

MAsserMAN, J. H. Behavior and neur 
University of Chicago Press, 1943, 

MATHEV 


. Part П: The sub- 


dis- 
A. Publication, 


sis. Chicago: 


l approaches to 
the invest sensitization and related 
procedures. Psychological Bulletin, 1971. 76, 73-91. 

Mealea, W. La & Nawas, M. M. The comparative 
effecti ess of systematic desensitization and im- 
plosive therapy in the treatment of snake phobia. 
Journal of Behavior Therapy and Experimental 
Psychiatry, 1971, 2, 85-94, 

MILLER, B. V., & Levis, D. J. The effects of varying 
short visual exposure times to a phobic test stim- 
ulus on subsequent avoidance behavior, Behaviour 
Research and Therapy, 1971, 9, 17-21. 

Mr N. E. Learnable drives and rewards, In 
S. S. Stevens (Ed.), Handbook of experimental 
psychology, New York: Wiley, 1951. 

Mowrer, O. H. A imulus-response analysis of 
anxiety and its role as a reinforcing agent. Psy- 
chological Review, 1939, 46, 553-565. 

Mowrer, О. Н. On the dual nature of learning: A 
reinterpretation of “conditioning” and “problem 
solving." Harvard Educational Review, 1947, 17, 
102-148. 

PauL, G. L. 


Insight vs, desensitization in psycho- 
in experiment in anxiety reduction. 
Stanford University Pr 1966. 

PavL, G. L. Behavior modification research: Design 
and tactics. In C. M. Franks (Ed.), Behavior 
therapy: Appraisal and status. New York: Mc- 
Graw-Hill, 1969. (a) 

Paur, С. L. Outcome of systematic desensitization, 
1: Background, procedures, and uncontrolled re- 
ports of individual treatment. In C, M, Franks 
(Ed.), Behavior therapy: Appraisal and status, 
New York: McGraw-Hill, 1969. (b) 

Pavi, G. 1. Outcome of systematic desensitiz: 
П: Controlled investigations of individual 
ment, technique va 
C. M, Fra 
and status. 


ion, 
treat- 
ations, and current status. In 
s (Ed). Behavior therapy: Appraisal 
New York: McGraw-Hill, 1969, (c) 


334 


PauL, G. L. Physiological effects of relaxation train- 
ing and hypnotic suggestion. Journal of Abnormal 
Psychology, 1969, 74, 425—437. (d) 

Potty, A. T. The effect of flooding and physical sup- 
pression as extinction techniques on an anxiety- 
motivated avoidance locomotor response. Journal 
of Psychology, 1959, 47, 253-255. 

Pnocraska, J. O. Symptom and dynamic cues in the 
implosive treatment of test anxiety. Journal of 
Abnormal Psychology, 1971, 77, 133-142. 

RacuMax, S. Studies in desensitization, II: Flooding. 
Behaviour Research and Therapy, 1966, 4, 1-6. 

RacuMAN, 5. Systematic desensitization. Psychologi- 
cal Bulletin, 1967, 67, 93-103. 

RACHMAN, S. Treatment by prolonged exposure to 
high intensity stimulation. Behaviour Research 
and Therapy, 1969, 7, 295-302. 

SCHACHTER, S., & SINGER, J. E. Cognitive, social, and 
physiological determinants of emotional state. Psy- 
chological Review, 1962, 69, 379-399. 

Ѕогомох, R. L, Kamin, L. J, & Wy Ly & 
Traumatic avoidance learning: The outcomes of 
several extinction procedures with dogs. Journal 
of Abnormal and Social Psychology, 1953, 48, 291- 
302. 

SOLOMON, R. Lọ, & Wynne, L. C. Traumatic avoid- 
ance learning: The principles of anxiety conserva- 
tion and partial irreversibility. Psychological Re- 
view, 1954, 61, 353-385. 

SrAMPrL, T. С. Implosive therapy, Part I: The 
theory. In S. G. Armitage (Ed.), Behavior modi- 
fication techniques in the treatment of emotional 
disorders. Battle Creek, Mich.: V. A. Publication, 
1967. 

Sramprt, T. G, & Levis, D. J. Implosive therapy. 
In R. M. Jurjevich (Ed.), Handbook of direct 
and behavior psychotherapies, Englewood Cliffs: 
Prentice-Hall, 1966. 

Stamprt, T. G, & Levis, D. J. Essentials of im- 
plosive therapy: A learning-theory-based psycho- 
dynamic behavioral therapy. Journal of Abnormal 
Psychology, 1967, 72, 496-503. (a) 

STAMPFL, T. G, & Levis, D. J. Phobic patients: 
Treatment with the learning theory approach of 


KENNETH P. MORGANSTERN 


implosive therapy. Voices: The Art and Science 
of Psychotherapy, 1967, 3, 23-27. (b) 

Stamprt, T. G., & Levis, D. J. Implosive therapy—@ 
behavioral therapy? Behaviour Research and 
Therapy, 1968, 6, 31-36. 

Sramprt, T. G., & Levis, D. J. Learning theory: An 
aid to dynamic therapeutic practice. In L. D. 
Eron & R. Callahan (Eds.), The relation of theory 
lo practice in psychotherapy. Chicago: Aldine, 
1969. . 

Straus, E. Duration of stimulus-exposure as determi- 
nant of the efficacy of flooding procedures in the 
elimination of fear. Behaviour Research and Ther- 
apy, 1968, 6, 131-132. 

UNDERWOOD, B. J. Psychological research. 
Appleton-Century-Croits, 1957. 

Watson, J. P., & Marks, I. M. Relevant and irrele- 
vant fear in flooding—A crossover study of phobic 
patients, Behavior Therapy, 1971, 2, 275-293. 

WILKINS, W. Desensitization: Social and cognitive 
factors underlying the effectiveness of Wolpe 5 
procedure, Psychological Bulletin, 1971, 76, 311- 
317. 

Wirus, R. W., & Epwaros, J. A. A study of the 
comparative effectiveness of systematic eg 
tion and implosive therapy. Behaviour Research 
and Therapy, 1969, 7, 387-395. К 

Wirsox, С. D. Efficacy of “flooding” procedures n 
desensitization of fear: A theoretical note. Be- 
haviour Research and Therapy, 1967, 5, 138- 

WILSON, б. T. & Davison, С. C. Process of fear 
reduction in systematic desensitization: Anima 
studies. Psychological Bulletin, 1971, 76, 1-14. 

Worrr, J. Psychotherapy by reciprocal inhibit 
Stanford: Stanford University Press, 1958. 

WorrE, J. The practice of behavior therapy- 
York: Pergamon Press, 1969. Р 

Worpr, J., & Lazarus, А. A. Behavior therapy аа 
niques: A guide lo the treatment of neuros? 
Oxford: Pergamon Press, 1966. 

Worrix, M., & RAINES, J. Visual imagery, 
roles and extinction as possible factors in 
fear and avoidance behavior. Behaviour 
and Therapy, 1966, 4, 25-37. 


(Received May 11, 1972) 


New York: 


ion. 


New 


expected 
reducing 
Research 


Psychological lletin 


1973, Vol. 79, No. 5, 335-340 


ALICE F. HEALY 4 


Criterion shifts in recall are examined 


of criterion. With a new index of the cr 
shifts in recall is found rather to be е 
constant across different retention inte 


became more lax with increasing task 
the subjects 
subjects shift their criteria in Bjork a 
Finally, this ana 
to leave blanks. 


Recent applications of signal detection the- 
Огу to the study of recall have been con- 
cerned with the problem of criterion shifts. 
The existence of such shifts has been the sub- 
Ject of much debate (Banks, 1970; Banks, 
1971; Bernbach, 1971; Murdock, 1966), but 
this debate has been clouded by confusion on 
two basic points. First, not all authors have 
Correctly distinguished between standard sig- 
nal detection analysis (Type I) and Type II 
analysis, Second, most authors, even those of 

* original signal detection studies (e.g., 
»Wets, Tanner, & Birdsall, 1961), have been 
consistent in their use of the term “crite- 
rion.” This paper attempts to clarify the con- 
Cept of criterion and reviews the evidence for 
Criterion shifts in the light of this clarification. 


Type I versus TYPE II ANALYSES 


1 The distinction between Туре I and Type 
Signal detection analyses has been mis- 


takenly equated (Banks, 1970, 1971) with 


the des ы к 

© distinction between the binary decision 
;., Confidence rating procedures. Clarke, 
Irdsall 


t ^, and Tanner (1959) defined Type I 
ysis as that applied when the subject 
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Rockefeller University 


in light of clarifications of the distinc- 


tion between Type I and Type II signal detection analyses and of the concept 


iterion, Murdock’s evidence for criterion 
vidence that subjects hold their criteria 
rvals. However, with the new criterion 


index, criterion shifts are found in intelligibility experiments and in the short- 
term recall experiment by Bjork and Healy. In these studies, subjects’ criteria 


difficulty. A possible strategy used by 


is proposed to explain this finding as well as the finding that the 


nd Healy’s task but not in Murdock’s. 


ysis is applied to recall studies in which the subject is allowed 


evaluates external stimuli provided by the 
experimenter, while Type II analysis is ap- 
plied when the subject evaluates his own 
responses. In either case, the subject’s evalua- 
tion may be a binary decision or a confidence 
rating. In the Type I case of evaluating an 
external stimulus, the subject may use the 
binary decision procedure by responding “sig- 
nal/noise” in a signal detection paradigm or 
“old/new” in a recognition task; alterna- 
tively, he may use the confidence rating pro- 
cedure by giving his level of confidence that 
the signal was present or that a recognition 
test item was an old item. For the Type IT 
situation, in which the subject evaluates a re- 
sponse he has just made on a test trial, he 
may give a binary decision by responding 
"correct/error" or he may use a confidence 
rating to indicate his judgment of the cor- 
rectness of his response. Clearly, confidence 
ratings are simply extensions of binary deci- 
sions; with ratings the subject can give a 
multivalued evaluation of a stimulus or of 
his response, whereas with the binary de- 
cision procedure he is restricted to a two- 
valued judgment. Thus, binary decision yields 
a single point on the receiver-operating-char- 
acteristic (ROC) curve, while a confidence 
rating procedure with » rating levels yields 
п — 1 points. 

The distinction between Type I and Type 
П signal detection analyses is subject to an 
even more subtle confusion, however, than 
that of equating it with the difference between 
the binary decision and confidence rating pro- 
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cedures. Clarke et al. (1959) referred to Type 
I analysis as stimulus conditional and to Туре 
II analysis as response conditional. As men- 
tioned above, this distinction can be based on 
whether the subject evaluates external stimuli 
or his own responses. However, the distinc- 
tion can also be based on how the signal detec- 
tion analysis is done: the ROC curves can be 
stimulus conditional or response conditional 
depending upon how the hit (or false alarm) 
rate is conditionalized. Specifically, the hit 

ws can be the probability of a certain evalua- 
tion by the subject given that the stimulus was 
a signal in a signal detection task or given that 
it was old in a recognition task. This is stimu- 
lus-conditional analysis. Alternatively, the hit 
rate can be the probability of a certain evalua- 
tion by the subject given that his current re- 
sponse was correct. This is response-condi- 
tional analysis, It is often assumed that these 
two bases for distinguishing Type I and Туре 
II analyses, that is, what the subject does and 
how his data are analyzed, are necessarily 
equivalent. However, this assumption is open 
to debate. 

For instance, in the recognition experiment 
by Donaldson and Murdock (1968), the sub- 
ject was asked to decide whether each stimu- 
lus was old or new and to give a confidence 
rating from 1 to 5 for his response. If Type I 
and Type II analyses are held to be defined 
by what the subject does, then this experiment 
must be treated with Type II analysis; but if 
Type I and Type II analyses are held to be 
distinguished by the method of analysis used, 
then this experiment can be handled by either 
analysis. 

As a matter of fact, Donaldson and Mur- 
dock applied Type I analysis. They treated 
their data as a 10-point rating by the subject 
of his confidence that the stimulus was old; 
thus, ап “old-5” response was taken as a 
rating of 10, while a *new-5" response was 
counted as a rating of 1. This enabled them 
to take their hit rate to be the probability that 
the subject emitted rating i (i = 1 ,..., 10) 
given that the stimulus was old, rather than 
the probability that the subject emitted rat- 
ing (= 1, А 
Was correct (i.e., was a hit or a correct rejec- 
tion). 

Since Type II analysis is response condi- 
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tional, rather than stimulus conditional, it 
yields information not about the subjects re- 
sponse generation but only about his response 
evaluation. Thus, a Type lI analysis of a 
recall task does not directly reveal the state 
of the subject's memory for the items but in- 
stead illuminates the evaluation process the 
subject applies to his recall responses. Values 
of d' and shifts in criteria revealed by Type 
II analysis reflect only this evaluation pro- 
cess, not the subject's memory state. 


INDEX OF CRITERION 


Inconsistent use of the term criterion has 
led to problems in determining whether CF 
terion shifts exist in the recall situation. part 
of the difficulty stems from the fact that 1- 
structions to the subject about what type 9 
criterion to maintain vary over different eX- 
periments. The subject may be told explicitly 
to regulate his hit and false alarm rates 
(Egan, Schulman, & Greenberg, 1959), 0r 10 
categorize his responses according to à pos; 
teriori probability estimates (Clarke, 1960), 0" 
to consider a payoff matrix when making his 
responses (Swets et al., 1961). These different 
instructions necessarily define different C" 
terion indices. However, in most experiments, 
the subject is told no more than to be "strict 
or “lax” in making his responses. T 

The most plausible psychological ipaa 
of these instructions seems to involve Re 
proportion of hits to false alarms that = 
subject allows himself to make. If instruc n 
to maintain a strict criterion, the subject yer 
allows himself very few false alarms wl 
to the number of hits he makes, whereas A 
structions to operate with a lax criterion ре 
dicate to the subject that he may allow mi 
false alarms relative to the number of Het 
hits. Most authors who employ a signal 
tection analysis identify the criterion a5 ? 
the cutoff point value on the likelihood T4". 
scale. They characterize the еей ag 
"strict" or "lax" depending on whether ie 
has a high or low value, and for a pro. 
probability of signal beta does indicate 
many false alarms the subject allow 
to make relative to the number of 
However, when the probability of 
changes, beta no longer reflects the 
hits to false alarms. In fact, the cuto 
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сап remain constant while the subject's de- 
cisions become more strict or more lax in 
terms of the number of false alarms he allows 
for each hit that he makes; conversely, with 
à change in probability of signal, the cutoff 
point can change even though the subject is 
not being more strict or more lax, In view 
of this inconsistency, it would seem advan- 
lageous to find a new index of the criterion 
that reflects the ratio of hits to false alarms 
more directly, 

Tf the subject is considered to be keeping his 
Criterion constant as long as his ratio of hits 
to false alarms remains constant, then con- 
Stant criterion means P(I) /P(FA) = А. For 

уре I analysis with a binary decision pro- 
cedure, P(H) is the joint probability of sig- 
nal plus noise occurring and the subject say- 
Ing "yes," so that P(H) = P(SN-Y). Simi- 
larly, P(FA) = P(N-Y) (Swets et al., 1961). 

епсе, constant criterion means 


P(SY-Y)/P(N-Y) = А. 


Since for any two events АЛ and B, 
P(A-B) = P(B|A):P(A), 
Constant criterion also implies that 
IP(Y|SN)/pP(Y N)]: |P(SN)/P(N)] = k. 


Therefore, if P(SN)/P(N) decreases, then 
P(Y|SN)/P(Y|N)—the ratio of the hit rate 
to the false alarm rate, or Z/ R/FAR—must in- 
“ease, given a constant criterion. A change in 
AR/PAR implies that the likelihood ratio cut- 
off Doint, or beta, is also changed. Thus, if the 
Probability of a signal changes during a task, 
‚ ТЄ Subjects cutoff point will also change if he 


15 б TAS "ue" а 
maintaining à constant criterion as defined 


te This prediction that beta should 
is ut With changes in probability of signal 
Stan Dsistent with the prediction made by the 
beta ard calculation of the optimal value of 
руу (без et al, 1961): 8 = У: [Р(№)/ 
Values mh Where V is the factor of costs and 
that ,, Therefore, those authors who contend 
despie Subject should hold beta constant 
the sul Changes in P(SN) are demanding that 
1t, inse E behave in a nonoptimal manner. 
Pty Stead, the subject is expected to hold 
timas, 41) constant, he can perform op- 
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The same argument holds for the case of л 
confidence ratings and Type П analysis (as 
well as for Type I confidence rating and Type 
П binary decision procedures). If the sub- 
ject employs a constant criterion for his 
highest confidence rating, R,, then PERA 
P(E-R,) =k, where C and E represent cor- 
rects and errors, respectively, (For a lower 
rating, Rj, 


PIC? (VR, ду... У) | / 


PIE: (RyRy... VR) | = bj.) 
Thus, as above, 
[P(R,|C)/P(R,|E) | | P(C) /P(E)] = k. 


In a recall task, P(C)/P(E) decreases with 
increasing retention interval; hence PORC 
P(R, E), which is ZR,/FAR,, increases with 
increasing retention interval, given a constant 
criterion. Again a change in HR,/FAR,—Aand 
hence a change in beta,— does not constitute 
a criterion shift. 

What measure, then, reflects criterion 
shifts? For Tvpe I analysis with binary deci- 
sion, a constant criterion means 


P(SN-Y)/P(N-Y) =k. 


Equivalently, 

ГРОЗУ ИРС) |. LP(Y)/P(Y)] = k, 
Since Р(Х|Ү) =1 — P(SN|Y), then 

P(SN|Y) = kG + 1) = 

for a given criterion. Thus, the a posteriori 
measure, P(SV'Y), is an index of the crite- 
rion. As P(SV|Y) changes or remains con- 
stant, so does the subject's criterion. Simi- 
larly, with Type II analysis and. confidence 
ratings, P(C- R,)/P(E-R,) =k for a con- 
stant criterion. Thus, 

| P(CCIR/P(E R,|- |P(R,) /P(R,)] = k, 


and since P(E'R,) =1 — P(CIR,), then 
P(C|R,) =k/(k+1) = &' for a given cri- 
terion. P(C/R,), therefore, indexes criterion 
changes. (Likewise, P(C К,үК, лу... УР) 
is the index for the jth criterion.) 


EVIDENCE FOR CRITERION SHIFTS 


With this criterion index in hand, what 
evidence can be found for criterion shifts in 
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TABLE 1 


CRITERION INDIC FOR POLLACK AND 
DECKER (1958) 


Signal-to-noi-e ratio 
; | 
Criterion index 


—9 db. —12db. | —15 db. 


P(C|Rs) 95 86 90 
P(C| Revs) | 98 81 84 
P(C|R;v--vRj) | .89 ло | M 
P(C|Rov- vR) | 85 67 | .60 
P(C|Rev:--vR) | .88 .60 49 


recall? The classic study on this question is 
Murdock's (1966). On each trial the subjects 
were shown a list of paired associates and were 
tested for recall of one of the items on this 
list. The subjects also gave confidence ratings 
of their responses. Murdock employed Type 
II analysis and found an increase in beta with 
increasing retention interval (serial position) 
for the highest confidence rating, the only 
rating he analyzed. However, he did not find 
evidence for a criterion shift as defined above, 
ior P(C|R,) was constant across retention 
intervals, Therefore, Murdock's finding of an 
increasing beta across retention intervals can 
be viewed as the natural consequence of the 
subjects' attempts to keep their criteria con- 
stant, rather than as an artifact of his experi- 
mental design as Banks (1970) argues. Since 
Murdock's results indicate that subjects 
can maintain a constant criterion over task 
difficulty, it is reasonable to ask if there is 
evidence that subjects ever shift their criteria 
over tasks of varying levels of difficulty. 
Intelligibility experiments provide a para- 
digm analogous to recall tasks in which to 
study criterion shifts. Such experiments, like 
recall, are generally treated with Type II 
analysis since the number of stimulus alterna- 
tives involved is greater than two. The gen- 
eral paradigm requires the subject to identify 
which member of a set of test words was 
presented and to rate his confidence in his 
response. Pollack and Decker (1958) used 
Such a paradigm and varied speech-to-noise 
ratio in order to manipulate task difficulty. 
They observed a constant P(C|R,) across 
different speech-to-noise ratios, which indi- 
cates that their subjects were able to hold 
their highest criteria constant. The indices 
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for the remaining criteria can be calculated 
from the conditional proportions РСК) 
and the proportions P(R;), both of which 
were presented graphically by Pollack and 
Decker, by means of the following equation: 


Р(С| Ri V Ry) 
РСК): POS) + P(C| Re): PO) 
ae P(R 3) + РО) 


The values of these indices are shown in Table 
1. As the table indicates, these lower criterion 
indices did not stay constant across task di " 
ficulty; rather, they seem to decrease with 
decreasing signal-to-noise ratio or increasing 
task difficulty. Pollack апа Decker, then, like 
Murdock, found no evidence for a shift of the 
highest criterion, but they did find shifts 0 
the lower criteria. d 
In a subsequent experiment Decker s 
Pollack (1958) considered the same problem 
but presented words under several e 
conditions (low-pass frequency cutoffs) dew 
than at several speech-to-noise ratios. Dec ES 
and Pollack did not include the figures neces 
sary to calculate the lower criterion indice 
but they did find that P(C|R,) decreased wit У 
decreasing cutoff frequency, or increasing tasa 
difficulty, indicating a shift of the highest 
criterion toward increasing laxness with et 
creasing task difficulty. Decker and Poll 
reasoned that a shift of the highest шепн 
occurred in this experiment with filtered Wo 
but not in the earlier experiment with sig 
speech-to-noise ratios because the qualita 
features and the average intelligibility a E 
various speech-to-noise ratios were not uei 
edly different, while the “sound” of the W A 
at the various filtering levels did contra? 
noticeably. shes 
A later study by Clarke (1960) furnish 
evidence for criterion shifts across diei 
signal-to-noise ratios. Clarke varied the ! the 
ber of stimulus alternatives as well 45 
signal-to-noise ratio. He did 
proportions P(R;) needed to comput : 
dices for the lower criteria, but he -— 0° 
P(C|R,) to be directly related to spe EE 
noise ratio and inversely related to ified 
set size. Again, the highest criterion s 
toward increasing laxness with e 
difficulty. Clarke explained this fin 


by 


pv cut 


task wv 
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hypothesizing that the subject rates a given 
gw item relative to the other items in the same 
test block rather than to all other items heard 
in a session, Indeed, items were blocked by 
Message set size and signal-to-noise ratio in 
his experiment and similarly blocked by task 
difficulty in the two experiments previously 
discussed, 
_ Although criterion shifts seem to be evident 
In intelligibility experiments, the question re- 
mains whether there is evidence for criterion 
shifts in recall experiments. Later papers 
(eg., Bernbach & Bower, 1970) reporting 
replications of Murdock’s finding of a con- 
Stant d' and an increasing beta across reten- 
lion intervals have not provided the data 
necessary to compute the criterion indices. 
Owever, such data were available to us from 
an experiment performed by Bjork and 
Healy at Rockefeller University. This ex- 
Dériment tested short-term recall at various 
retention intervals. On each trial the subject 
Was shown four letters followed by 3, 8, or 18 
digits. As in Murdock’s experiment, retention 
Interval was randomized across trials; how- 
ever, here retention interval was determined 
Y the number of digits intervening between 
Presentation and test rather than by serial 
Dosition, Moreover, on each trial of this ex- 
Petiment, the subject was forced to recall all 
Our letters in their proper order, while Mur- 
Ock tested only one item per trial. As in 
“urdock?s experiment, the subject also as- 
Signed to each item recalled a rating indicat- 
mg how confident he was that it was correct. 
A 3-point scale was employed here; Murdock 
ue а 6-point scale. Each of 12 subjects par- 
wid ded in two sessions of this experiment 
h 72 trials per session. 
Boma 2 shows the criterion indices as com- 
Conf lor the two criteria separating the three 
ence ratings for each of the 12 subjects 
fepe rh of the three retention intervals. A 
19 €d-measures analysis of variance (Winer, 
Consta 'ndicated that the null hypothesis of a 
Cou] nt Criterion across retention intervals 


~p Tejected for each criterion (F = 4.00, 
NET уте 
lem Bjork & A.F. Healy. Intra-item and extra- 


1—05 of acoustic confusion in short-term 

n, Communications in mathematical psy- 
echnical Report series) New York: 
niversity, 1970, 


TABLE 2 


CRITERION INDICES FOR BJORK AND 
Heaty (1970) 


P(C\R») P(C|RwR:) 

Sub- Number of digits Number of digits 

ject intervening intervening 
| 1 £ 9 | B a 8 | 18 
BE NEN | 36 | .65 | .si 
2 | 97 | оа | | 96 | .82 | 87 
3 | .69 | .70 | 1.00 | 65 | 53 | б 
4 | .98 95 | | .97 | .88 | .66 
5 | .92 68 | | «92: | 165. | т 
6 | 96 | 89 | 93 | 8з | .80 
7 | .88 80 | 86 | .70 | .65 
8. | чә. || 288: | 93 | .80 | 5% 
9 | 80 | .60 | .88 | аз | 40 | 30 
10 | .95 | .98 | 1.00 | 94 | 92 | оз 
п | 87 | 4 36 67 | ыз 
12 86 а | S | 58 | ae 


df = 2/22, p < .05, for the higher criterion; 
F = 21.67, df = 2/22, p< :01, for the lower 
criterion). As in the previous experiments 
where criterion shifts were found, the sub- 
ject’s criteria became more lax with increas- 
ing task difficulty. 


Discussion AND CONCLUSIONS 


Why do we find evidence for criterion shifts 
in Bjork and Healy's data but not in Mur- 
dock's? If subjects shift their criteria as 
a result of a tendency to rate a given item 
relative to other items of the same retention 
interval rather than relative to all other items, 
they would be able to do so only if items from 
the different retention intervals were blocked 
or were highly discriminable. In both of these 
recall studies the various retention intervals 
were not blocked but were presented ran- 
domly throughout the session, However, it 
could be argued that retention interval was 
blocked to some extent in the Bjork and Healy 
experiment since on every trial the four letters 
tested, each of which was given a separate con- 
fidence rating by the subject, necessarily came 
from the same retention interval. In addition, 
trials at the different retention intervals were 
probably more discriminable in the Bjork and 
Healy experiment, where retention interval 
depended on number of intervening digits, 
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than in Murdock's experiment, where reten- of Bernbach’s finite-state theory in recall. Psycho- 


tion interval depended on serial position. logical Bulletin, 1971, 76, 151-152. 


Such evidence for criterion shifts does not 
bear solely on experiments which employ con- 149-150. 
fidence ratings. These findings have an im-  Bernpacu, Н. A, & Bower, С. Н. 


Bernpacu, Н. A. Invariance of d* in memory: Re- 
sponse to Banks. Psychological Bulletin, 1971, 76 


Confidence 


portant implication for experiments in which ratings in continuous paired-associate learning. 


: Psychonomic Scie 70, 21, 252-253 
subjects are not forced to respond to every sychonomic Science, 1970, 21, 252-25 


item but instead are free to leave blanks when sponses and confusion matrices in in 


CLARKE, F. R. Confidence ratings, second-choice re- 


telligibility 


uncertain: In these experiments P(C) for tests. Journal of the Acoustical Society of America, 
overt responses can be seen as analogous to |, 1960, = eu A wp, JE 

B LARKE, F. R., BIRDSALL, T. G., & Tanner, W. £4 J* 
P(C|Rn) 30: Шаба change in P(C) for overs Two types of ROC curves and definitions of PA” 
responses would be evidence for a criterion rameters. Journal of the Acoustical Society of 
shift. (See Lockhart, 1969, for a similar argu- America, 1959, 31, 629-630. 


ment.) If such criterion shifts do occur and Decker, L., & Porrack, I. Confidence T 
go undetected, the result could be a distorted message reception for filtered speech. 


picture of memory loss with increasing re- pes 


atings and 


Journal 0, 
the Acoustical Society of America, 1958, 30, 432- 


tention interval. P(C) for all responses might рохлірѕох, W., & Мокроск, B. B. JR- Criterion 
be inflated at the longer intervals due to a change in continuous recognition memory. eed 
325-330. 


more lax criterion there. Thus, although Type of Experimental Psychology, 1968, 76, 


II analysis of recall cannot yield any new in- Eoan, J. P. SCHULMAN, А. I, & GREENBERG) Di y 


А 1 3 
formation about the subject's memory for the decisions and by ratings. Journal of the 
given items in a task, Type II analysis can Society of America, 1959, 31, 768-773. 


Operating characteristics determined by 


A coustical 


illuminate the important evaluation processes LOCKHART, R. S. Retrieval asymmetry and the cri- 


which go on in recall and which may affect terion problem in cued recall. Journal 


the measure of memory loss in such experi- mental Psychology, 1969, 81, 192-19 


of Exper 


B t- 
Munpocx, B. B., Jr. The criterion problem 1n shor 


ments. term memory. Journal of Experimental Psychor 

М ogy, 1966, 72, 317-324. E- 

+For example, W. К. Estes. On the source of  porrack, I, & DECKER, L. Confidence ratings, 216 

acoustic confusions in short-term memory for letter sage reception, and the receiver operating hine 
strings. In, Communications in mathematical psychol- teristic. Journal of the Acoustical Society 

ogy. (Technical report series) New York: Rockefeller America, 1958, 30, 286-292. Pe 
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In the article “Defining and Testing Hypotheses in Multidimensional ContigencY 


Tables” by Juliet Popper Shaffer in the February 1973 issue, the concluding P 
theses of Footnote 3 on page 128 should read as follows: 


aren- 


(Bishop, 1969; Goodman, 1970, 1971a). еї 


In addition, the first sentence under the heading Log-Linear Model on pas 
should read as follows: 


The log-linear model, with properties investigated extensively by Birch (1970), 


Bishop (1969), Goodman (1964a, 1969}, 1970, 1971а), and Haberman 
provides. ... 
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LONG-TERM PSYCHOLOGICAL EFFECTS OF LSD * 


SPEN 


JER A. McWILLIAMS? axb RENÉE J. TUTTLE 


University oj Arizona 


The literature reporting lasting adverse psychological reactions to LSD is re- 
viewed, along with other relevant reports where LSD users were followed. The 


danger of long-lasting psychological damage is low when the drug 
emotionally stable individuals in secure, controlled settings 


is used by 
but persons with 


psychiatric disturbance, unstable personalities, and current crisis situations have 
experienced pathological behavior temporally related to drug ingestion. Ad- 
verse reactions were not reported in well-controlled studies with normal sub- 


jects, indicating the drug 


Although lysergic acid diethylamide (LSD) 
Was discovered to have hallucinogenic powers 
In 1943, drugs with similar makeup and phar- 
Macological effects have been used for many 
Years, especially by American Indians in their 
Te 151005 rituals (Pelner, 1967). Since the syn- 

esis of LSD, these drugs have become a 
Subject of empirical study among behavioral 
Scientists studying the effects of such agents, 

er purported ability to induce schizophrenic 
States, and their use in psychotherapy. During 

¢ 1950s a number of research projects in- 


vestigating the clinical effects of LSD were 
Mdertaken (See Cohen, 1967; Cole & Katz, 


К 64; Freedman, 1968, 1969; Ludwig & Le- 
E 1966; Osmond, 1957; Unger, 1963, for 
views) 

M 
abili 


any laymen became aware of the reported 
ty of LSD to induce altered perceptual 
5 and intense positive personal experi- 
delie» (e.g., Huxley, 1954). This “psyche- 
of Тет ahah (Osmond, 1957) led to use 
Amon, in the general population, particularly 
i mee the young. Although long-term reac- 
from ү LSD had occasionally been reported 
~ Medical administrations (Cohen, 1960), 


ra is expressed to Helen and Vincent 

Mente versity of Rochester, for their helpful 

arlier version of this paper. 

Willig for reprints should be sent to Spencer 

Ан; ms, Department of Psychology, Univer- 
Zona, Tucson, Arizona 85721. 


elative safety for continued research. 


the increasing. nonmedical usage prompted 
warnings about the possibility of adverse psy- 
chological effects resulting from LSD usage. 

A pamphlet distributed by the State of New 
York warned that LSD can precipitate an 
acute psychotic episode, requiring extensive 
treatment and from which some persons may 
never recover (Narcotic Addiction Control 
Commission, 1968). Grinker (1963, 1964) 
warned against the use of LSD unrelated to 
scientific research and criticized the results 
of LSD therapy suggesting that LSD could 
induce harm, Louria (1967) associated use 
with liberalization of morals and permissive- 
ness and criticized LSD because its users 
withdraw from productive activities in so- 
ciety. He reported that many patients were 
hospitalized with psychoses induced by LSD, 
Largely because of such warnings, possession 
of LSD is now illegal, and research on its 
pharmacological effects and use in psycho- 
therapy has decreased. 

The issue is a controversial one, however, 
and not all authorities agree with the above 
conclusions. Hollister (1967) stated that LSD 
does not induce permanent psychosis and ex- 
plained that the reported psychoses were due 
likely to the attraction of such drugs to people 
who are already emotionally disturbed. Freed- 
man (1968) concurred and said that a causal 
relation between emotional disorder and LSD 


usage was not justified simply by the presence 
of emotional turmoil in concurrence with the 
drug. 

The unfavorable publicity surrounding LSD 
usage has hampered research efforts. Dahl- 
berg, Mechanek, and Feldstein (1968) sur- 
veyed 29 investigators doing research on LSD- 
type drugs. The 19 who responded indicated 
that unfavorable publicity made it difficult 
for them to recruit appropriate subjects and 
caused pressure to discontinue research with 
the drug. Mogar (1968), in announcing his 
decision to withdraw entirely from research 
with’ psychedelics, discussed what he con- 
sidered an oppressive social climate surround- 
ing the drugs, which hindered the investigator. 
He felt that prevailing attitudes made crea- 
tive and objective research difficult, and open 
and rational discussion of the drugs was hin- 
dered because of biases generated by sensa- 
tionalism and controversy. Because of the il- 
legality of the drugs, subsequent nonmedical 
usage has gone “underground,” making it dif- 
ficult to locate people who would admit to 
taking the drugs and producing sampling 
biases in subject selection (Blum, 1964). 

Since much of the public furor over use of 
LSD was generated by claims that its use can 
engender permanent harm, it is important to 
investigate the evidence reporting ill effects to 
determine how closely its usage is related to 
long-term psychological damage. Reviews re- 
port adverse reactions to LSD (Greenblatt & 
Shader, 1970; Schwartz, 1968; Smart & Bate- 
man, 1967), but attempts have not been made 
to differentiate between various conditions 
under which the drug was taken. In addition, 
these reviews failed to relate incidences in 
which follow-ups did not report ill effects. 

The present study reviews the literature on 
long-term adverse psychological effects. Such 
a reaction is defined as pathological behavior 
(including suicide, psychosis, and severe anx- 
iety) lasting for at least 48 hours after in- 
gestion of the drug, the time by which the 
pharmacologic effects have usually subsided 
(Hoffer, 1965). These criteria for adverse re- 
actions are difficult to follow in some cases 
since some reports did not distinguish between 
a dysphoric drug experience and a long-term 
effect. In addition, investigators have defined 
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adverse reactions in different w: Та one sur- 
vey (Ungerleider, Fisher, Goldsmith, Fuller, 
& Forgy, 1968), for example, a physician's 
biases toward LSD led him to state: “1 would 
consider all reactions to LSD as ‘adverse’ re- 
gardless of the immediate subjective response 
[p. 355 |." Reports of negative reactions which 
subsided with drug effects were not included. 
The bulk of the literature in this area con- 
sists of reports of adverse reactions, but in 
addition, studies which reported the use of 
LSD and included a follow-up of the users 
were included. 

LSD has typically been administered under 
one of three basic conditions: (а) to PSY 
chiatric patients in research or psychotherapy 
programs; (5) in unsupervised, nonmedical 
settings, and (c) to normal subjects in Te 
search programs. Since the setting under which 
such drugs are taken influences the outcome 
of the experience, the present discussion 15 
divided accordingly. Most of the literature 
consists of case studies and statistical surveys 
which could easily be criticized on methodo- 
logical grounds. There are a few unusually 
clean and well-planned studies (Ditman, 
Tietz, Prince, Forgy, & Moss, 1967; Mc- 
Glothlin, Cohen, & McGlothlin, 1967, 1968; 
Savage, Fadiman, Mogar, & Allen, 1966; 
Shagass & Bittle, 1967) but these are excep" 
tional. 


"C CHE S 
PSYCHIATRIC PATIENTS IN PsyCHOTHERAP 


AND RESEARCH 


Much of the original research with ы 


was in drug therapy. Procedures and progra! ; 
devised for LSD therapy have appear? 
(Abramson, 1967; Shlein, 1968, Section 3). ; 
Cohen (1960) sent a questionnaire to 62 
vestigators using LSD or mescaline and Т 
ceived 44 responses with reference to us 
5.000 individuals and more than 25,000 pni 
ministrations. He reported four suicide a 
tempts among patients (1.2 per 1,000 cases? 
and concluded that a direct connection dis- 
tween LSD and suicidal behavior could be oie 
cerned in very few instances. The ane ace 
pleted suicide, previously discussed by “prenic 
(1957), was a chronic, regressed, schizo ad 
girl with a 10-year hospitalization ently 
undergone LSD therapy. When inadve 
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allowed out of the hospital on a weekend 
p she committed suicide by throwing her- 
self under a train, 

Hoffer (1965), discussing Cohen's study, 
felt that the rate of suicide was very low con- 
sidering the hopelessness of most of the pa- 
tients and concluded that LSD may have de- 
creased the rate since a high suicide rate could 
be expected in such a group. Smart and Bate- 
man (1967), however, criticized Cohen’s find- 
ings on the grounds that the rate of com- 
plications for the therapists who failed to re- 
spond may have been higher and that there 
Was little follow-up of some of the patients. 
They reported 14 attempted and 6 successful 
Suicides following Cohen’s (1960) study, but 
Concluded that it was difficult to attribute 
Suicidal behavior to LSD since the cases were 
Suicide-prone prior to drug use. 

Malleson (1971) sent questionnaires to all 
Physicians in the United Kingdom who were 
known to have used LSD on human subjects. 
Of the 74 physicians, 73 replied reporting on 
4,300 patients with approximately 49,000 
treatments, and 170 experimental subjects 
With 450 administrations. The reported suicide 
Tale was .7 per 1,000 cases with three com- 
pleted suicides and nine serious attempts. 
“leven additional suicide attempts were re- 
Ported, but no direct relation between the 
Suicidal behavior and LSD ingestion could be 
determined. Response to this survey was 
Nearly 100% and included approximately the 
Same number of subjects but a larger number 
of administrations than Cohen’s (1960) sur- 
Vey, thus speaking to the earlier criticism 
Smart & Bateman, 1967) that the rate of 
op blication was reduced due to the number 

nonresponders. 
кы апа Hartman (1960) reported a 
Euer of suicide in a woman after her first 

ment with LSD. She had a long history 
riage olism and narcotic addiction and had 
* three previous suicide attempts. Prior to 

я un session, she confessed to her thera- 
na nae she was planning to commit suicide 
Wit ae night, and after an argument 
This E husband she made good her threat. 
d ib only adverse reaction in а project 

& LSD therapy with 110 patients. 


bi 
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In Cohen's (1960) survey, eight instances 
(1.8 per 1,000 cases) of lasting psychotic epi- 
sodes following LSD were reported, seven of 
these in patients undergoing therapy. Five had 
long-term courses while two went into remis- 
sion within a few weeks. Malleson's (1971) 
survey reported 35 cases of psychosis (9 per 
1.000), 19 of which recovered completely in 
less than 3 months. There were 10 cases of 
long-term psychosis, some of which were seen 
as potential psychotics who would have suc- 
cumbed eventually regardless of LSD use. 
Details relating to the other eight psychotic 
episodes were unknown. 

Cohen and Ditman (1962) reported five in- 
stances of psychotic breaks. They found un- 
derlying hysterical or paranoid personalities in 
each and concluded that the reactions were 
due to unskillful therapeutic management, 
permitting an upsurge of unconscious material 
which the patient was unable to handle. In a 
subsequent article (Cohen & Ditman, 1963) 
they reported several cases of prolonged psy- 
chotic decompensation, depressive reaction, 
and paranoid reaction in LSD therapy pa- 
tients. In most of these cases psychological 
disturbances antedated the treatment, but the 
yatients had been functioning outside a hos- 
pital. The authors concluded that the inci- 
dence of complications following LSD was 
probably infrequent, although unknown, and 
usually occurred in emotionally labile, hys- 
terical, or paranoid personalities. 

Fink, Simion, Haque, and Itil (1966) ad- 
ministered LSD to 65 psychotic subjects with 
an average hospitalization of 5.1 years. There 
were 158 administrations, and three prolonged 
reactions were noted, consisting of distur- 
bances of mood, affect, and thought. The 
symptoms appeared to be an exaggeration of 
the patient’s pretreatment state. The risk of 
negative reaction to LSD therapy was greater 
in subjects with emotional lability and psy- 
chotic features than in classical forms of 
schizophrenia. 

In a follow-up of patients who underwent a 
single experience with LSD or mescaline, Sav- 
age, Savage, Fadiman, and Harman (1964) 
reported three negative incidents in question- 
naire responses. One patient felt he did not re- 
ceive enough support during the session and 
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suffered from obsessive ruminations for 9 
months following the session, after which he 
improved. Another patient who had a long 
pre-LSD history of depression, hospitalization, 
and suicide attempts made another unsuccess- 
ful suicide attempt 2 months after the LSD 
session. A third patient did not keep his fol- 
low-up appointments and entered another 
facility where a second LSD treatment was 
proposed. He had a psychotic break on the 
evening of the proposed second session, but 
recovered, However, 83% of the respondents 
claimed lasting benefits, with an increase from 
76% at 3 months to 85% after 6 months. The 
authors made clinical ratings on 74 of these 
patients 6 months after the session and found 
that 60 improved, 13 had not changed, while 
1 was changed for the worse, although after 
6 months of therapy he returned to his orig- 
inal base line. These ratings were based on 
clinical interviews and psychological data in- 
cluding the MMPI (pre- and post-) and the 
Leary Interpersonal Check List. 

All of the preceding studies can be criticized 
on methodological grounds. A major flaw in 
each is the absence of a non-LSD control 
group comparable to LSD-taking patients in 
pretreatment pathology. One attempt to intro- 
duce experimental controls was made by Mc- 
Glothlin and Arnold (1971) in a 10-year fol- 
low-up of 247 patients of three physicians who 
had experimented with LSD between 1955 and 
1961. The experimental sample was matched 
with a control group of non-LSD patients 
{тот the same therapists based on psychologi- 
cal tests and attitude measures. No significant 
differences were found between the control 
group and subjects who had taken LSD only 
in therapistinitiated sessions. Of the experi- 
mental subjects, 19 were hospitalized with 
psychosis after their LSD experiences, but 
only one case was reported as being related to 
LSD ingestion. This was a patient who re- 
ceived three LSD treatments, was hospitalized 
1 week after the final drug experience, and 
was unable to work for 3 months. 

The above reports, although methodo- 
logically weak, indicate that the danger of 
psychiatric disturbance resulting from LSD is 
slight when the drug is used with proper sub- 
jects in well-supervised settings. Thus, the 
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evidence does not provide grounds for discon- 
tinuing LSD treatment. Additionally, other 
projects did not report serious, prolonged, ad- 
verse reactions. Amarel and Cheek (1965) 
studied verbal behavior under LSD by ad- 
ministering 100 micrograms and 200 micro- 
grams of LSD 1 week apart to 10 male al- 
coholic patients. They reported a 1-week fol- 
low-up after the second session with no ad- 
verse reactions. Bhattacharya (1966) reported 
on 581 LSD treatment cases and 2,742 ad- 
ministrations. He reported no cases of either 
temporary or permanent psychosis, no suicide 
attempts, and no behavior so bizarre that it 
was beyond control of the therapist. 

Savage et al. (1966) reported on 77 pa- 
tients who had psychedelic therapy over à 10- 
month period. Two thirds of the patients re- 
sembled typical psychiatric outpatient cases, 
while one third were mildly neurotic. The ses- 
sion consisted of a 200-300 microgram dose 
of LSD and lasted an entire day. The thera- 
pist was present to provide companionship 
but not interpretation. Patients were followed 
up to 6 months and were assessed by psycho- 
logical tests, clinical evaluations staff ratings, 
and behavior change. Results were generally 
positive and no adverse reactions were Te- 
ported even though some patients were fol- 
lowed for up to 1 year. 

Shagass and Bittle (1967) studied the ef- 
fects of a large single dose of LSD 6 months 
and 1 year following the session: Patients were 
matched with non-LSD controls on sex, edu- 
cation, marital status, diagnosis, and symD" 
toms. In addition, the experimental patients 
were divided into two groups: responders (л 
= 8), who displayed insight under LSD; anc 
nonresponders ( = 12). Patients were ratec 
on amount of improvement through an inter- 
view, and the experimental group improve 
significantly more than the control group © 
< 05 at 6 months, p < .02 at 1 year). more 
controls than LSD subjects required follow-UP 
therapy. No prolonged adverse reaction 
reported. 

Although it is apparent that LSD сап 
duce an exacerbation of symptoms Qr tri 
chotic behavior in some disturbed psychia ive 
patients, the literature is far from сой t 
on the role of the drug experience in ра! 
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cal behavior. One of the major problems is the 
fact that base-rate statistics for suicidal be- 
havior and Psychosis among patients are not 
available. Sandison (1966) stated that both 
suicide and psychosis are possible complica- 
tions of psychotherapy even when no drugs 
аге used. Levine and Ludwig (1964) also felt 
the results support a conclusion that the drugs 
аге safe and indicated that perhaps the in- 
cidence of negative reactions is just as high 
or higher for comparable patients in other ac- 
tive therapies, The result of several other re- 
ports (e.g., Buckman, 1968: Savage, 1968) in 
Which LSD therapy produced positive results 
lends Support to the contention that LSD 
Сап be а safe therapeutic instrument. when 
Properly administered to suitable subjects. 


Reactions TO UNSUPERVISED DRUG UsAGE 


The majority of reported adverse reactions 
ave been from instances where the drug was 
laken in unsupervised, nonmedical settings. 
Undoubtedly, most LSD ingestions have oc- 
curred under such conditions, but the exact 
number of administrations, purity, and dosage 
evel remains unknown. The literature con- 
Sists mostly of case reports and surveys rather 
than experimental studies, 
lumenfield and Glickman (1967) reported 
9n 25 patients with a history of LSD use seen 
in the psychiatric emergency room of a hos- 
Pital over a 10-month period. Thirteen were 
admitted with symptoms related to LSD and 
Schizophrenic or “borderline schizophrenic” 
Clagnoses, In addition, four suicide and two 
lomiciqe attempts were noted. Seventy-two 
Percent of the patients had psychiatric treat- 
Ment prior to taking LSD, and 40% of these 
ad psychiatric hospitalizations. The authors 
late = the current hospitalizations were re- 
life im Previous psychiatric difficulties and 
drug ations rather than the effects of the 
буг, his conclusion was supported in a fol- 
het Of 15 of these patients (Glickman & 
tients 10014, 1967). Tt was found that the pa- 
| Tesg],, Ran LSD ingestion as an attempt to 
same oh life crisis, and the stresses were the 
Without a. which often lead to hospitalization 
Only thas ae use. The authors concluded not 
have à the psychoses and suicides would 
Ccurred if ‘the patients had not taken 
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LSD, but the fantasy of “cure” through LSD 
may have helped to postpone the actual break. 

A survey of negative side effects of LSD 
was reported by Ungerleider and his colleagues 
(Ungerleider, Fisher, & Fuller, 1966; Unger- 
leider, Fisher, Fuller, & Caldwell, 1968). 
Seventy patients whose diagnoses were related 
to LSD, seen over a 7-month period, were re- 
viewed. Twenty-five of these were admitted to 
the hospital. Twenty-seven percent were di- 
agnosed psychotic, 21% neurotic, and 18% 
character disorder. Ten percent had previous 
psychiatric hospitalizations and 27% had re- 
ceived psychiatric outpatient care. The 25 in- 
patients were compared with a “control” 
group of 25 frequent LSD users, a group of 
rehabilitated criminals, who were not hos- 
pitalized to determine correlates of adverse re- 
actions. The groups were compared on demo- 
graphic background, marital status, employ- 
ment, police record, education, previous psy- 
chiatric care, and the MMPI. Significant dif- 
ferences were found for marital status (096 
inpatients married versus 60% controls, p < 
001), employment (20% inpatients working 
versus 70% controls, р < .01), and police 
records (8^5 inpatients versus 64% controls, 
p < .001). This latter finding was due to the 
fact that the control group was composed of 
rehabilitated criminals. The inpatient group 
displayed severe psychopathy on the MMPI, 
but similar findings were not present in the 
controls. 

Ditman et al. (1967) were also interested in 
why some people had adverse reactions while 
others did not and may even have found LSD 
therapeutic. Subjects were 116 people who 
had taken LSD. Group 1 (n — 52) consisted 
of people who did not need psychiatric care. 
Most were students or working people. Group 
2 (n-— 27) consisted of psychiatric outpa- 
tients who applied for treatment as a result of 
LSD ingestion, and Group 3 (n= 37) was 
composed of psychiatric patients hospitalized 
because of LSD use. Groups were compared 
on the basis of an interview, demographic 
variables, drug history, and the DWM Card 
Sort Test, an instrument developed to assess 
LSD experiences, The employment status of 
subjects was assessed prior to hospitalization 
but after they had taken LSD. The per- 
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centage of subjects who were employed or 
were full-time students was significantly higher 
in Group 1 than in Group 3 (p < .001). Since 
this was not assessed until after the subjects 
had taken LSD, however, a cause-effect rela- 
tion cannot be determined. On the DWM 
Card Sort Groups 2 and 3 reported more un- 
pleasant feelings under LSD than Group ! 
(p < .01) and more intense feelings of despair 
and hopelessness (p < .05). Items reflecting a 
a positive, beneficial experience were reported 
more frequently by Group 1. The authors con- 
cluded that unstable individuals who take 
^ without support and protection are 
likely to find the experience disruptive, while 
the results in the nontreatment group sup- 
ports the notion that LSD has a potential 
value for research in psychotherapy and es- 
thetic appreciation. 
Frosch, Robbins, and Stern (1965) studied 
12 of 27 patients admitted to a hospital as a 
result of LSD ingestion over a 4-month pe- 
riod. Seven were diagnosed as panic reactions, 
three had a reappearance of the drug symp- 
toms without reingestion of the drug, and 
three were psychotic. The panic reactions 
rarely led to hospitalization. Of the three with 
reappearing symptoms, two were diagnosed 
as chronic schizophrenics and the other was 
a borderline psychotic. The three with ex- 
tended psychoses following LSD ingestion 
had clear long-standing schizophrenia: two 
were catatonic and one was paranoid. The 
schizophrenic patients developed a unique 
sense of “insight” under LSD and were sub- 
sequently unable to adapt to the environment. 
A second sample (л = 22) was taken from 
the same hospital and compared with the 
original group (Robbins, Frosch, & Stern, 
1967). This group had more drug experiences 
but the symptoms still fell into the same three 
categories: (а) 11 panic reactions, (5) 8 
symptom reappearances, and (c) 8 extended 
psychoses. The recurring symptoms were seen 
in repeating users when exposed to an anxiety- 
arousing situation and were felt to involve 
psychological as well as physiological factors. 
Contrary to earlier findings, three of the psy- 
chotics were seen as not psychotic before tak- 
ing LSD, although suggestions of severe per- 
sonality disturbance were present. The authors 
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warned that adverse effects could not always 
be predicted from previous history. 

Louria (1967) reported over 130 patients 
hospitalized with LSD-induced acute psy- 
choses, or chronic schizophrenia, and schizoid 
personality exacerbated by LSD. Of these, 
34.2% were reported as having underlying 
personality disturbance, and the author con- 
cluded that an increasing number of suscep- 
tible, unstable individuals were being in- 
fluenced to use LSD. 

Not all studies have indicated preexisting 
psychiatric difficulties in those becoming psy- 
chotic. Tietz (1967) reported a study of 49 
patients admitted to an inpatient service with 
LSD-related complications. Eighty percent 
were from the lower socioeconomic class, and 
dosages from 100 to 2,000 micrograms were 
reported. Patients received psychological test- 
ing and had Bender-Gestalt tests within nor- 
mal limits, but MMPI profiles similar to 
schizophrenics. Fifteen had acute panic reac- 
tions, 5 requiring hospital care, while 6 had 
recurring symptoms and 28 had extended psy- 
choses. Conclusions are limited by the lack of 
information on premorbid adjustment, but 
none had previous psychiatric hospitalizations. 

Many reports on LSD reactions have been 
case studies, which, although valuable for 
gathering data, fail to provide a basis for de- 
termining a relation between LSD and sub- 
sequent reactions since they lack experimental 
controls. Appropriate caution must be used in 
making conclusions from these studies. 

Bowers, Chipman, Schwartz, and Dann 
(1967) reported three cases of adverse reac- 
tions to LSD, two requiring treatment. A 19- 
vear-old college sophomore was admitted vit a 
depression and disorganization after having 
taken LSD. The history revealed a poor chil B 
hood, a previous long-term hospitalization, 
and an inability to relate to others. A 20- 
year-old medical student was seen as an аш- 
patient for five sessions because of anxiety 
following LSD but was not considered pe 
ously disturbed. The authors concluded к. 
each patient had intense needs for і: 
sonal closeness and no access to € 
emotional experiences, Muller (1971) 16 ding 
three cases of prolonged LSD effects y | 
to hospitalization. All were released а 
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weeks, although two returned after a short 
period with a relapse of symptoms. Electro- 
Convulsive treatments produced remission in 
these cases, 
. Two cases of psychosis due to LSD inges- 
lion of sufficient severity to warrant hospital 
admission were reported by Hatrick and Dew- 
hurst (1970). Only one dosage of LSD was in- 
volved in each case, and in one case it was ad- 
Ministered to the patient surreptitiously. Both 
Patients had well-adjusted premorbid person- 
alities, no previous history of regular drug use, 
and both appeared normal for between 2 
Weeks and 2 months before the onset of psy- 
chotic effects. Although the symptoms abated 
Within a number of weeks, the authors con- 
cluded that these cases were of sufficient. se- 
Verity to warrant the removal of LSD from 
Use in Psychotherapy. Tn a rebuttal, Malleson 
1970) argued that the nature of the rela- 
tion between the drug and illness was inde- 
terminable, the pathology was more likely 
Tought on by emotional upset in the patients, 
and the conclusion that such therapy should 
© discontinued was not justified on the basis 
of anecdotal information. 
forts to determine the relation between 
and psychiatric disturbance have also 
psy akened by unreliable and subjective 
Ychiatric diagnoses. Often, reports appear 
attity pases on value judgments, and patient 
Ты Es and life-styles rather than standard 
< IOSlic criteria, Rossi (1971) discussed the 
‘Psychedelic syndrome” as one form of ad- 
bi reaction characterized by preoccupation 
extra Mystical subjects, belief in astrology and 
Societal POLY perception, and a rejection ot 
3 ET values, He related LSD usage in such 
tween Яй attempt to avoid the decision pe 
ass «ea commitment and passivity. 
tela * euh Owers (1970) reported four LSD- 
ases hospitalized by their families be- 


lrawn manner. The patients had mystical 
Се, les, belief in oriental religions, and per- 
late, i €mselves as passive agents manipu- 
With "Y Cosmic forces, АП were well oriented 
thoy ES Боо memory for recent events, al- 
Visu rd of past events was vague, and 
Verg, Wtllucinations and paranoid delusions 
In each case, heavy drug use 


and subsequent reactions appeared when the 
patients were facing heavy stress. The authors 
inferred that heavy drug use may have af- 
fected concurrently developing attitudes and 
values, thus facilitating the acquisition of 

ystical beliefs. A major difficulty with this 
interpretation is the fact that mystical beliefs 
are shared by a large number of young per- 
sons with no history of LSD use. It is equally 
possible that the adverse reactions were due 
simply to an inability to deal with stress situ- 
ations. The diagnosis of unpopular attitudes 
and mystical beliefs as psychiatric symptoms 
is based on cultural value judgments rather 
than objective psychiatric criteria and only 
obfuscates understanding of drug effects. 

Although it is evident that persons may de- 
velop serious psychological disturbances after 
taking LSD, a causal relation between the two 
events is most difficult to determine, In nearly 
every case where a prolonged reaction was 
noted, the history showed signs of psychologi- 
cal disturbances which antedated drug use. In 
addition, evidence indicates that many per- 
sons who take the drugs do so with a belief 
that it will help them solve their emotional 
problems. It is impossible to determine the 
number of unsupervised LSD ingestions, but 
even with the most conservative estimates it 
is obvious that many more persons are using 
the drug than are having negative effects. The 
most reasonable conclusion from the preceding 
section is that LSD can cause psychological 
difficulties in disorganized and disturbed indi- 
viduals who ingest the drug in less than secure 
surroundings, without psychological support, 
and at a time when emotional problems or 
crises are at a peak, 


Use WITH EXPERIMENTAL SUBJECTS 


Although many experiments studying the 
effects of LSD on normal subjects have been 
reported, most investigated the effects of acute 
intoxication on psychomotor or psychological 
ks. Few of these studies followed the sub- 
ject for any period of time after the drug ex- 
perience. As a result, although many reports 
appear, few are relevant to the present review. 
Studies investigating LSD and including a fol- 
low-up are discussed first, followed by studies 
designed to explore long-term effects, 
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Abramson and his colleagues performed 
a long series of experiments assessing the 
effects of LSD on many psychological and 
physiological responses (e.g., Abramson, Jar- 
vik, & Hirsch, 1955; Abramson, Jarvik, Kauf- 
man, Kornetsky, Levine, & Wagner, 1955). 
They used many of the same subjects in over 
150 experiments over a 10-year period. The 
records of six subjects were shown (Abram- 
son & Rolo, 1965) and no adverse reactions 
were reported. The possibility of adverse re- 


r tions was discussed and it was felt that no 


5 anger existed as long as drug usage was 


` supervised by a physician. 

In an experiment comparing the LSD ex- 
perience to schizophrenia, Langs and Barr 
(1968) gave LSD to 30 normal subjects. Eight 
subjects with anxious, manipulative, and hos- 
tile traits on predrug testing showed behavior 
similar to paranoid schizophrenics while under 
LSD, but no long-term adverse effects were 
found. In two studies involving 69 peniten- 
tiary inmates, Katz, Waskow, and Olsson 
(1968) gave 50 micrograms of LSD to 23 sub- 
jects and found either euphoric, dysphoric, or 
ambivalent responses. No adverse reactions 
were reported. Keeler (1965) gave psilocybin 
to 12 normal subjects, personnel at the re- 
search center where he worked, and found sig- 
nificantly higher scores on the schizophrenia 
and hypochondria scales of the MMPI from 
the pretest to a test taken while under the 
drug. Since the subjects had employment con- 
tact with the experimenter after the session, 
a long follow-up is assumed, but no adverse 
reactions were reported. 

Welpton (1968) performed an intensive 
week-long study of LSD users and gave them 
LSD under his supervision. Extensive screen- 
ing, testing, and interviewing were undertaken 
and poor family histories as well as some psy- 
chological maladjustment were found. “The 
most impressive finding was the relative infre- 
quency of adverse reactions to LSD and other 
hallucinogens considering the degree of per- 
sonality disturbances [p. 378].” In Cohen’s 
(1960) original survey, one psychotic reaction 
in an experimental subject was reported. This 
was a man who was an identical twin of a 
hospitalized schizophrenic, and the reaction 
subsided within a few days. Pollard and Uhr 
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(1965) reported on 5 years of experimentation 
using LSD and psilocybin with 80 college stu- 
dent subjects and reported no persistent ill 
effects, due to careful subject screening. 

The low incidence of negative experiences 
among experimental subjects is even more im- 
pressive considering a series of findings re- 
ported by Cohen (1960). He related several 
studies in which the following reactions were 
attributed to the drug experience: migraine 
headaches, influenza, paraplegia, a suicide at- 
tempt, and a skin rash. АП of these reactions 
occurred in control subjects who received 
water placebos. This finding indicates that 
hysterical, suggestible subjects may explain 
subsequent difficulties on the “drug” experi 
ence, indicating that reactions reported due 
to the drug may have been related instead to 
personality factors. 

One of the most extensive and well-con- 
trolled programs to investigate the long-term 
effects of LSD was conducted by McGlothlin 
et al. (1967, 1968), assessing change due to 
LSD in personality, attitudes, values, interests, | 
and performance among normals. Subjects (л 
= 72) were male graduate students who vol- 
unteered for the experiment without knowing 
it involved LSD. Their knowledge of LSD was 
determined and those with LSD experience, 
currently in psychotherapy, or psychoses ш 
the family were screened out. Others were 
dropped because of interview impressions E 
doubtful MMPI profiles. Subjects were given 
a battery of tests assessing anxiety, perso” 
ality, attitudes, esthetic sensitivity, and wt 
tivity. Then, they were divided into Cr 
groups of 24, matched with respect to E v 
edge of the drugs and expectations, e 
ceived the following drugs in a double-b ч 
design: Experimentals, 200 microgfam, s 
LSD: Control 1, 20 milligrams of amp? 
mine: and Control 2, 25 micrograms a 
(below threshold dosage). Subjects s E 
three sessions and received the same p 
dosage each time. On 2-week and wh tbe 
follow-ups, subjects were retested e Е 
same instruments. Thirty-three pereen айх 
experimental subjects had lower ; 
| жеке" of in Control 2 41 
scores compared to 137% In CON tal 5 
in Control 1 after 2 weeks. Experime! i 1 
jects had a significantly lower galva 
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response to stress at the 6-month follow-up. 
Sixty-two percent of the experimental sub- 
jects reported an increased appreciation of 
music, which was supported by behavioral 
Changes such as buying records and attend- 
ing concerts. Other tests showed differences in 
favor of the experimental group but most 
failed to reach statistical significance. No ad- 
Verse reactions were reported, although one 
Subject was terminated due to an “unrealistic 
reaction" to the first drug dose. 

Ditman, Hayman, and Whittlesey (1962) 
Studied 74 subjects with LSD experiences 
from 6 months to 3) vears prior to the fol- 
low-up. Half the subjects were “normals” and 
half were psychiatric patients, and each re- 
Ceived 100 micrograms of LSD. Question- 
naires regarding the experience were com- 
Dleted at the follow-up and 66% reported 
Some degree of benefit. Patients claimed more 
Improvement than normals, although normals 
reported more pleasant LSD experiences. The 
authors concluded that if subjects’ claims are 
Valid, the LSD experience may be therapeutic 
itself. No adverse reactions were reported. 

An investigation of the effects of LSD on 
Problem solving and creative abilities was re- 
Ported by Harman, McKim, Mogar, Fadiman, 
and Stolaroff (1966). Subjects were 27 male 
Professionals whose work required creative 
Problem-solving abilities (engineers, physi- 
cists, mathematicians, architects, artists). 

hey were found to be psychiatrically normal 
and motivated to discover and apply problem 
Solutions within their professional capacity. 

16у were asked to select a problem and en- 
couraged to work on the problem under the 

rug, Subjects were given 200 micrograms of 
Mescaline sulfate (equal to 100 micrograms 
o E in groups of four, accompanied by 
Music Rm They relaxed and шше to 
de or 3 hours, spent 1 hour on psycho- 
Er alins and worked on their problems 
Porteg | hours, The psychometric dua. x 
Under | nien: increases in creativity while 
Subject е drug and at the 2-week follow-up 
Unetign n Ported significant changes in dis 
bilities a Which related to their Enhanced 
Were "ч uring the drug experience. Subjects 
Adverse 2 6 Weeks after the session and no 
factions were reported. 


The above studies indicate quite clearly 
that when psychologically well-adjusted indi- 
viduals took LSD under controlled, secure, 
environmental situations where support was 
available, no long-term adverse reactions oc- 
curred. 


CONCLUSIONS 


The major conclusion of this review is that 
research on LSD, particularly its long-term 
psychological effects, is very poor. Most 
studies are based on subjective case reports, 
surveys, or used very weak experimental con- 
trols. The lack of comparable control groups 
and base-rate statistics for nondrug therapies 
makes interpretation of studies with patients 
very difficult. Any direct relation between drug 
use and subsequent behavior remains strictly 
conjectural. In many case studies, interpreta- 
tions of behavior were often based on the 
biased judgment of the investigator rather 
than any standard criterion. Very few con- 
trolled studies investigating the areas of psy- 
chological functioning affected by LSD, espe- 
cially those areas espoused by proponents of 
consciousness-expanding experiences, have ap- 
peared. The work of McGlothlin et al. (1967, 
1968) represents the type of research with the 
greatest potential for generating scientific 
knowledge of the drug's effects. 

Those adverse reactions which have oc- 
curred were most frequent among individuals 
who were emotionally disturbed, in crisis situa- 
tions or insecure environments, and who took 
the drug in unsupervised settings. Psychotic 
episodes and suicidal behavior, while rare, 
have occurred in such circumstances. The few 
well-controlled studies indicate, however, that 
when the drug was administered to psychologi- 
cally normal subjects under secure circum- 
stances, lasting adverse reactions did not oc- 
cur. Thus, the concerns which led to discon- 
tinuation of research are unsubstantiated, and 
the type of controlled research discussed above 
could be resumed without undue danger to 
subjects. 
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Experiments in the double runway and its 
in terms of several theories that have bee 


of the complexity of the situation, 
lion effect (FE) and a hypotheti 
vincingly demonstrated. Rather, 
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ments, demotivation, and differe: 
clearly supported by the FE, bu 
the role of frustration in producing it, 

In 1952, Amsel and Roussel provided an 
apparently unambiguous demonstration of the 
motivating effects of frustration. They trained 
rats in two runways placed end to end so that 
the goal box of the first served as the start 
box for the second. After training with con- 
tinuous reward in both goal boxes, reward in 
the first goal box was presented on an inter- 
mittent schedule. It was found that running 
speed in the second runway during partial re- 
ward was greater after the omission of reward 
in the first goal box (N trials) than it was 
after reward (R trials). Amsel and Roussel 
called this difference in speed the frustration 
effect (FE) and attributed it to increased 
motivation following omission of an expected 
reward. The present paper is a review of sub- 
Sequent experiments on the FE and the 


theories that have been proposed to account 
for FE. 


THEORIES OF THE FRUSTRATION EFFECT 


Amsel’s Frustration Theory 


The principal th 


cory of the FE has been 
the frustration 


theory elaborated by Amsel 
(1958, 1962). His position is that the frustra- 
tion reaction includes a temporary increase 
in the strength of generalized drive, The 
antecedent condition of frustration is the 
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analogs are reviewed and evaluated 
n applied to this situation, Because 


a simple correspondence between the frustra- 
cal frustration response has not been con- 
a theoretical account of the FE 
possible effects of secondary reward, competing responses, generaliz 
ntial conditioning. 
t neither are 


must include 
ation decre- 
Frustration theory is not 
there clear grounds for denying 


omission of reward in the presence of an 
expectancy of reward, and the expectancy 
is identified with Spence's (1956) nr 
mechanism. The amount of frustration shou 
be directly related to the strength of ra. 3 
The success of Amsel's theory in apie 
the FE has been evaluated in two ways: 
tests of specific assertions in Amsel’s (— 
and (5) experiments with control proc pu. 
or experimental manipulations derived ре 
other interpretations of the FE. ‘The К 
type of test is discussed in this section; br 
periments of the second type are Wr 
along with the theories from which the 
derive. m. 
Magnitude of reward. Two basic ana 
mental designs have been employed in stu 
of the effect of magnitude of reward ‘ferent 
FE. In between-groups experiments, di ШЕТ 
groups of rats have been trained with A 
ent magnitudes of intermittent reward mA ў 
first goal box. In within-subjects exper!” 
different stimuli in the first runway have 
correlated with different magnitudes 0 
mittent reward in the first goal box. Ва 
Using the between-groups procederen 
has usually been found that the di je 


intet 


In addition, two studies (Carlson, o "sien 
& удуй. 1969) found a statistical ilet 
cant FE with a large incentive e i - 
obtain the effect with a small ince! 
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within-subjects experiments, it has been con- 
sistently found that the difference between 
R and N trials increases with magnitude of 
reward (McHose, 1969; McHose & Gavelek, 
1969; McHose, Meyer, & Maxwell, 1969; 
Peckham & Amsel, 1964, 1967). 

Frustration theory, however, makes the 
stronger prediction that absolute speed on N 
trials should increase with increasing magni- 
tude of reward on R trials. Only one between- 
Broups experiment (Krippner et al., 1967) ob- 
tained a difference on N trials. The others 
Obtained differences on R trials only. In 
Within-subjects experiments, Peckham and 

msel (1964) found greater speeds on N 
trials with increasing magnitude of reward; 
Peckham and Amsel (1967) and McHose et 
al. (1969) each found an N trial effect in one 
Of their two experiments but not in the 
Other, The other within-subjects experiments 
found effects on R trials only. 

These inconsistent results on N trials are 
not explicable in terms of Amsel’s theory, but 
\tippner et a]. (1967) have made an addi- 
tional assumption about the double runway 
that may resolve these differences, They have 
Suggested that increasing magnitude of re- 
Ward increases the secondary reward value 
Of the first goal box. This produces a tendency 
Or the subject to return to the first goal from 
* second runway. Speed in the second run- 
ау results from the combination of this 
*hdency to retrace with the effects of frus- 
tration and reward in the second goal. On 
we basis of this three-factor interpretation, 
consistent results seem less surprising. 

Another similar ad hoc account has been 
SUggested by Wilton, Strongman, and Neren- 

erg (1969). They attributed their failure 

9 Obtain reliable FEs with pigeons to com- 
Se responses which they observed and 
De that failures to obtain increases in 
reward Speed with increasing magnitude of 

mn „Тау also be due to the effects of such 
i dung Tesponses, The danger inherent 
tion 5 m is that, without prior pe 
Competir e relative effects of motivation ant 
те 18 responses in a particular experi- 
be ex, almost any effect of frustration might 

“pected, 
ha, uli in the first runway. Two studies 

Varied the similarity of the stimuli in 
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the first runway to the stimuli in the first 
goal box (Amsel, Ernhart, & Galbrecht, 1961; 
Amsel & Hancock, 1957). Both found a 
larger FE when the two were more similar. 
Аз this operation was presumed to be a 
manipulation of the strength of rg, these ex- 
periments are interpreted as supporting Am- 
sel's theory. However, it is unclear why, in 
terms of Spence's (1956) theory, this manip- 
ulation should change the strength of rg in 
the goal box itself. 

Prior training. According to the frustration 
theory (Amsel, 1958), the size of the FE 
should vary as the expectancy of reward is 
manipulated by different types of training. 
Several different experimental designs have 
been employed in testing this prediction: (a) 
Partial reward has been given from the outset 
of training: The FE gradually increases as 
such training progresses (Amsel & Ward, 
1965). (b) Partial reward has been intro- 
duced aíter a relatively small number of con- 
tinuously rewarded trials: The FE has been 
found to be an increasing function of the 
number of prior rewards (Amsel & Ward, 
1965; Hug, 1970a; Stimmel & Adams, 1969; 
Yelen, 1969). (c) Groups of rats have been 
given chronic nonreward in the first goal box 
and then shifted to partial reward: Under 
these conditions, the FE appears in the first 
few trials after the shift in reward percentage 
(Amsel & Prouty, 1959; Ison, Daly, & Glass, 
1967; Ludvigson, 1967; McCain & McVean, 
1967; Platt & Senkowski, 1970) and does 
not differ in size from the FE shown by a 
control group given continuous reward before 
the shift. 

In the first two designs, acquisition of tg 
is confounded with acquisition of the second 
instrumental response, while in the third de- 
sign, the FE appears very quickly. None of 
these results, then, can be said to support 
frustration theory. 

Yet another possible effect of training on 
the strength of rz; is provided by extinction. 
McHose (1963) shifted different groups of 
rats from continuous reward to either partial 
reward or extinction. He found that the FE 
disappeared as extinction progressed but re- 
mained unchanged in the partial reward 
group. In discriminative extinction it has 
been found that the FE, after nonrewarded 


354 


trials to the negative stimulus, decreases as 
the discrimination develops (Amsel & Ward, 
1965; Hug, 1970b). These experiments pro- 
vide strong support for the idea that the FE 
depends on the strength of r;, and the two 
experiments with partial reward control 
groups (Hug, 1970b; McHose, 1963) show 
that the effect is not simply due to some form 
of habituation to the effects of nonreward 
itself. 

Variations in the frustrated response. In 
Amsel's theory, frustration is specifically de- 
fined as the disconfirmation of an expectancy, 
rather than as the thwarting of a response, 
an interpretation suggested by some other 
authors (e.g., Brown & Farber, 1951; Spence, 
1960). On this basis, Amsel should predict 
little effect of the nature of the frustrated re- 
sponse on the FE, although this implication 
has not apparently always been appreciated 
by Amsel himself (Amsel et al., 1961 ). Vari- 
ations in the length of the first runway of a 
double runway marginally affected the ЕЕ 
in one experiment (Amsel et al., 1961) and 
had no effect in another (Clifford & Schindel- 
heim, 1968). Negative results have also been 
obtained with variations in the length of 
fixed-ratio bar-pressing schedules (Cole & 
VanFleet, 1970) and the effort required for 
the frustrated response (Carlson, 1968; 
Grusec & Bower, 1965). Correlating the par- 
tial reward schedule with the vigor of the 
first response also does not seem to affect the 
FE (Logan, 1968). 

Incomplete incentive reduction. Beginning 
with an experiment by Bower (1962), the 
effects of incomplete incentive reductions in 
the double runway have been studied ex- 
tensively, 

Increasing the amount of reward magnitude 
reduction in the first goal box of the double 
Tunway has consistently been shown to in- 

Speed in the second runway 
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small reward in the first goal box, similar to 
the “Wagner” control group for reward- 
omission experiments to be discussed below, 
must be run. With this between-groups de- 
sign, no evidence for an FE produced by in- 
complete reward magnitude reduction e 
been obtained (Barrett et al., 1965; pe 
1968; Karabenick, 1969; McHose & Lu 
vigson, 1965). jit- 
However, Hall and Marr (1969) gave dı 
ferential magnitude of reward training in the 
first runway of a double runway and then 
presented test trials on which the magnitudes 
of reward were reversed. They found that the 
large reward magnitude produced the same 
speed in the second runway, regardless be 
which stimulus had been present in the we 
runway; the small reward resulted in füs | 
running after it was given in the large rewa" 
stimulus than when it was given in the sma 
reward stimulus. id 
Most investigations of delay of rewarc 1s 
the double runway have found slower pg 
in the second runway following delay pis 
ward than following immediate reward (7 i 
Hose, 1966a; Sgro, Glotfelty, & p 
1970; Sgro, Glotfelty, & Podlesni, 19 a 
However, Sgro (1969) found that speed a 
creased when partial delay of reward ke 
introduced and that the amount of increa ; 
was directly related to the amount of poa 
He attributed the difference between his the 
sults and the other work to differences in 
delay procedure used. sed ап 
McHose (1966b, 1968) has бале 
FE both within subjects and in compa! 


F ; tive ^? 
with control groups for absolute incen reward 
simultaneously increasing. delay of foun 


and reducing magnitude of reward. He 1 
that the FE increased with inane. 
In the first study, there was no € in the 
amount of magnitude reduction, Loan ^ 
second, he found a negative relations! P and 
tween the amount of magnitude reductio ns of 
the size of the FE. These demonstrat" ive 
FEs, with a combination of the two ее ШЫ 
variables, dictate caution when pus и 
the failure of either variable alone ot ria! 
ап FE in comparison with the арі 5 
control group. ey stl @ 

e nature of the. de putet 
effect. Tn Amsel's theory, the FE is 4 


rio! 
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to a lemporary increase in drive, although 
the additional possibility of frustration-in- 
duced Tesponses is recognized. There is con- 
Siderable evidence that the FE does, in fact, 
Tepresent the nonspecific effect on vigor im- 
Plied by a Motivational concept. 

Several studies have obtained an FE mea- 
Sured as a difference in activity following R 
Dx N trials (Dunlap, Hughes, O'Brien, Lewis, 

Dachowski, 1971; Gallup & Altomari, 
1969: Gallup & Hare, 1969; Tacker & Way, 
968; Topping, Cole, Matamoros, & Linen- 
berger, 1970). The FE was also demonstrated 
When force of bar pressing was measured im- 
mediately after intermittent reward of chain 
Pulling (Levine & Loesch, 1967), and when 
aggression was measured after intermittent 
Teward of running (Gallup, 1965). These ex- 
Deriments in which the frustrated response is 
different from the response used to measure 
frustration support the idea that the FE is 
а motivational effect, 

One experiment (Schmeck & Bruning, 
1968) has been directly addressed to the re- 
ationship of the FE to Spence's (1956) drive 
concept, In an apparatus consisting of a run- 
Way followed by either another runway or 

Y à complex maze, these investigators found 

at frustrative nonreward in the first run- 
Way increased speed in the second runway 
Ut also increased the number of errors in 

Пе maze, 

Evaluation of Amsel's theory, The success 
OF Amsel’s theory in dealing with these find- 

8S has clearly been rather limited. However, 
the ambiguity of the relationship of many of 
indi results to frustration theory does not 

rate that the theory is unfalsifiable. 
i Aber, it indicates that the double runway 


5 " H 
3 much more complex situation than an- 


ticipated, 
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"Ned Reactions to Frustration 


(grown and Farber (1951) and Marx 
a ave suggested that, early in life, 
ted may learn to respond vigorously to 
hay; ation because vigorous, aggressive be- 


anj 


avi ‹ 
fruge „aS been frequently reinforced in 
This ating situations in the subject's past. 


d х М S 
терот ‘erential reinforcement of vigorous 
Чор mg when frustrated transfers to the 

runway situation and produces the 
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FE. Such a theory leads to all the same pre- 
dictions about the FE as does the motiva- 
tional account, excepting experiments involv- 
ing prior reinforcement of vigor in another 
frustrating situation. The only experiment 
directly addressed to this proposal (Amsel 
& Penick, 1962) was inconclusive. AII that 
can be said about the learned vigor interpreta- 
tion, then, is that there is no evidence on the 
one point in which it differs from Amsel's 
theory. 


Escape from Frustration 


It has been suggested that the FE repre- 
sents escape from frustration rather than in- 
creased drive after frustration (Leitenberg, 
1965; Wagner, 1966). This proposal is the 
obverse of the secondary reward hypothesis 
proposed by Krippner et al. (1967) discussed 
above, and has about the same high face 
validity as an interpretation of the double- 
runway situation, where the subject does, in 
fact, escape from the frustrating situation, 
There have been, however, a number of 
demonstrations of the FE in operant situa- 
tions where performing the second response 
did not remove the subject from the frustrat- 
ing situation (Berger, 1969; Carlson, 1968; 
Cole & VanFleet, 1970; Davenport, Flaherty, 
& Dyrud, 1966; Platt & Senkowski, 1970). 


Unreduced rq 


Bower (1962) has proposed that 7g is 
aroused by the stimuli in the first runway 
and goal box of the double runway and, sub- 
sequently, is reduced by reward in the first 
goal box. On nonrewarded trials, this rg for 
the first reward is not reduced, but persists 
into the second runway, summates with the 
rq for the second reward, and thereby invigo- 
rates responding. This theory makes the same 
predictions as frustration theory without as- 
signing unique properties to frustrative non- 
reward. 

Berger (1969) has deduced a similar ac- 
count of the FE from Perkins’ (1968) theory 
of preparatory responses. The preparatory 
response to the first reward persists on N 
trials, thereby increasing the value of the 
second reward. This leads to the prediction 
that the FE should be greater when both re- 
sponses lead to the same type of reward than 


356 


when they lead to different rewards. Berger 
himself (1969), however, found the same size 
FE regardless of whether the second response 
was motivated by food reward or shock avoid- 
ance; Levy and Seward (1969) found the 
same size FE with food in the first goal box 
and either food or water in the second; Amsel 
and Penick (1962) obtained an FE with 
food in the first goal box and a "social" ani- 
mal in the second; and a number oí studies 
have found an FE with presumably unre- 
warded activity as the indicant response 
(Dunlap et al, 1971; Gallup & Altomari, 
1969; Gallup & Hare, 1969; Tacker & Way, 
1968; Topping et al., 1970). There is, then, 
no support for the Berger-Perkins interpreta- 
tion of the FE in the one area where it differs 
from the interpretations proposed by Amsel 
and Bower, 

A similar account of the FE can be derived 
from the concepts of inertial tendency (At- 
kinson, 1964) or persisting tendency (Birch, 
1968). These writers suggest that an ra, once 
aroused by a stimulus, will persist for some 
time in the absence of stimulus support until 
a goal, like the one on which it is based, is 
reached. 

The only difference between this view and 
the theories already discussed is that persist- 
ing tendencies have been shown to last for 
as long as 30 minutes (Birch, 1968), while 
the FE is marginally attenuated after 30 
seconds’ confinement in the first goal box 
(Amsel & Roussel, 1952), substantially at- 
tenuated after 90 seconds’ confinement (Mac- 
Kinnon & Amsel, 1964), and completely 
eliminated after 5 minutes’ confinement (Rob- 
inson & Clayton, 1963). 


The Demotivation Hypothesis 


Seward et al. (1957) performed an FE 
experiment in which, in addition to partial 
reward in the first goal box, intermittent re- 
ward was given in the box of the first 
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As Amsel defined írustration in terms of 
disconfirmation of an expectancy rather than 
frustration of a response, there is no reason 
to assume, as Seward et al. (1957) apparently 
did, that intermittent prefeeding should not 
produce frustration. The effects of intermit- 
tent prefeeding obtained by Seward et al. 
and subsequent investigators (Kemble & 
Clayton, 1967; McCain & Power, 1966) 
appear to be another case of the absence of 
an effect on the FE of the task required in 
the first runway discussed earlier. Р 

Maltzman (1952) reviewed the early liter- 
ature on prefeeding and concluded that small 
amounts of prefeeding facilitate subsequent 
performances while larger amounts inhibit Ms 
Daly (1968) found a similar effect with dif- 
ferent magnitudes of reward in the first goal 
box of a double runway, and Dachowski 
(1967) found that different magnitudes of 
continuous reward in the first goal box of à 
double runway had no effect on running speed 
in the second runway. These inconsistent 
demotivation effects stand in sharp contrast 
to the surprising reliability of the FE. It 
seems unlikely that an effect which is elusive 
when studied directly can account entirely 
for a different phenomenon which is marke 
by the frequency and consistency with which 
it has been demonstrated. ү, 

The inadequacy of the demotivation m 
pothesis alone was most convincingly des 
strated in an experiment by Wagner (195: a 
In addition to groups of rats showing the a 
he ran a control group that was never = 
warded in the first goal gox. The FE е 
assessed by comparing performance of er” 
partial reward groups on N trials qub E 
formance of the control group, for W S 
all trials were N trials but presumably ^ J 
frustrating. He found a reliable betw k 
subjects FE measured in this way whic 
not be explained in terms of demotv uced 
Nonreward after a history of reward [о е 
faster running than nonreward Leap 1 
prior rewards. Since Wagner's (1959) 9. раз 
experiment, this *Wagner control" s 
been used frequently. A few dude ub” 
failed to find the FE measured betwe бї, 
jects in this way (Kemble & Clayton ja F 
Ludvigson, 1967; McHose, 1963; Él ob 
Braud, 1969), and Daly (1968) 9^7 
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tained a reliable between-subjects effect in 
One of three pairs of groups, but most studies 
have obtained the between-subjects FE 
(Barret et al., 1965; Dunlap & Frates, 1970; 
Dunlap et al., 1971; Hamm, 1967; Ison et al., 
1967; McCain & McVean, 1967; Robinson 
& Clayton, 1963; Schmeck & Bruning, 1968; 
Strongman & Wookey, 1969). 


Generalization Decrement Hypothesis 


, In addition to possessing possible motiva- 
tional properties, the events in the first goal 
box of a double runway may serve as part 
of the stimulus complex controlling the second 
Tesponse. The first few trials after any change 
In conditions, then, should be interpreted with 
Caution (Amsel & Prouty, 1959). Further, 
9n fixed-interval schedules in operant pro- 
Cedures, reward is an important cue controll- 
ing the fixed-interval “scallop” (e.g., Staddon, 
1967). Omission of reward on these schedules, 
then, removes the principal cue for slow 
responding, producing an increase in response 
Tate that may be entirely due to a generaliza- 
Поп decrement. 

Staddon (1970) has suggested that this 
analysis may not be specific to fixed-interval 
Schedules but may be appropriate to FE 
€xperiments in general. Most of the responses 
“sed to measure the FE show a positive goal 
Sradient, although the effect is not usually 
AS marked as it is with fixed-interval sched- 
ules, After continuous reward, therefore, the 
aftereffects of reward should have a condi- 
tioneq inhibitory effect on behavior through 
Meir association with slow responding early 
In the chain, Omitting reward produces a 
Seneralization decrement, which in this case 
results in an increase in response speed in the 
arly Part of the runway or response chain. 
far, bilans that reward can acquire 
Cale. Ко subsequent responding (e.g. 

» Meyer, Maxwell, McHewitt, & Mc- 
ow 1969; Staddon, 1970), however, do 
& Imply that this must always be the case, 
there are several FE experiments that 


nm to present difficulties for Staddon's posi- 
the b 15 not clear in this framework why 
Tewar should persist with continued partial 
1963) ming (Hug, 19700; McHose, 
tion ( ut disappear with continued extinc- 


“Hose, 1963) or discrimination train- 
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ing (Amsel & Ward, 1965; Hug, 1970b). The 
findings that shifts from chronic nonreward 
{о intermittent reward produce an almost 
immediate FE (Amsel & Prouty, 1959; Ison, 
Daly, & Glass, 1967; Ludvigson, 1967; 
McCain & McVean, 1967; Platt & Senkowski, 
1970) appear to be as difficult for Staddon's 
theory as they are for Amsel's. In terms of 
Staddon's theory, the sudden introduction of 
reward should have the opposite effect. 

In order to explain any of these results, 
additional assumptions must be added to 
Staddon's theory. In view of the lack of ex- 
plicit assumptions in Staddon's (1970) paper, 
it is impossible to properly evaluate this in- 
terpretation of the FE. 


Differential Conditioning Interpretations 


The double runway and its operant analogs 
can be viewed as a special case of differential 
conditioning in which the two stimuli are 
presented in closely successive pairs in a 
fixed order (Daly, 1968; McHose, 1970). 
That is, responses in each runway are re- 
warded on particular schedules, which may 
be different; and the two runways are dis- 
criminable on the basis of either order of 
presentation or physical differences. Thus, in 
addition to any effects attributed to after- 
effects of the event in the first goal box, the 
contrast and generalization effects generally 
observed in differential conditioning (Black, 
1968; Dunham, 1968; McHose, 1970) should 
also be observed in the double runway. Daly 
(1968) first elaborated this view in account- 
ing for effects of magnitude of continuous 
reward in the double runway and used it, in 
combination with frustration theory, to ac- 
count for her incentive-reduction results. 

McHose (1970) has presented a more 
systematic account, arguing that the results 
of double runway experiments may be ade- 
quately attributed to (a) effects of differential 
conditioning and (5) demotivational proper- 
ties of reward. He has further proposed a 
description of differential conditioning that 
predicts negative contrast or generalization 
effects in both positive and negative stimuli, 
The size of the effect is proportional to the 
difference between the incentives signalled by 


the two stimuli. 
The within-subjects FE, according to Mc- 
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Hose, is entirely due to demotivational effects 
of reward, but the between-subjects FE ob- 
tained with the Wagner control group is due 
to both factors. Chronic nonreward in the 
first goal produces slower running in the 
second runway for the Wagner control group 
than does the continuous or partial reward 
given to the experimental group (contrast 
or generalization effects), Combination oí 
these two partially independent effects, then, 
can potentially explain not only the between- 
subjects FE but also the reported failures 
to obtain this result. 

There are, however, some weaknesses in 
his analysis, First, as has been pointed out, 
the demotivating effects of reward are not 
found as consistently as the FE, The second 
weakness lies in the failure to explain the 
immediate occurrence of the FE when chronic 
nonreward is changed to reward (McCain 
& McVean, 1967), In this experiment, the 
difference between N trials and R trials not 
only appeared almost immediately, but repre- 
sented an increase in speed on N trials rather 
than a decrease on R trials. As differential 
conditioning is presumably an associative 
process like the conditioning of rg in Amsel's 
(1958) theory or the development of inhibi- 
tory control in Staddon's (1970), this experi- 
ment presents the same problem for all three 
analyses. 

It also seems that in experiments showing 
the effect of magnitude of reward on the FE, 
the analysis Proposed by McHose has as in- 
Consistent a record as frustration theory. 
McHose's analysis seems to predict that, if 
reward in the first goal box is larger than 
reward in the second, then N-trial Speed 
Should be inversely proportional to magnitude 
of first goal box reward. This result, the 
Opposite of the one predicted by Amsel, has 
not been obtained (Barrett et al, 1965; 

Пррпег et al., 1967; McHose & Ludvig- 
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al, 1969; Fox, Calef, Gavelek, & McHose, 
1970; McHewitt, Calef, Maxwell, Meyer, & 
McHose, 1969). However, it is not clear that 
this interpretation is superior to Daly's 
(1968) combination of differential condition- 
ing and frustration effects rather than differ- 
ential conditioning and demotivation effects. 


CONCLUSIONS 


While early experiments on the FE were 
taken as strong evidence for the existence of 
frustration drive (e.g., Amsel, 1958, 1962; 
Mowrer, 1960; Spence, 1960; Yates, 1962), 
subsequent research and theorizing has shown 
that the double runway is such a complex 
situation that this basic conclusion may have 
been unwarranted, The numerous accounts of 
the double runway may be classified accord- 
ing to their success in accounting for the data, 
although all have been important in suggest- 
ing control procedures and new directions for 
investigation. 

* Theories of persisting tendencies (At 
kinson, 1964; Berger, 1969; Birch, 1968), 
escape from frustration (Leitenberg, 1965; 
Wagner, 1966), and generalization decre- 
ments (Staddon, 1970) cannot alone account 
for a number of results. ; 

* Several theories are isomorphic to Amsel's 
(1958) frustration theory in the double rule 
way (Bower, 1962; Brown & Farber, et 
Marx, 1956), and will share the fate of that 
analysis. Amsel's theory itself, like the de 
motivation hypothesis (Seward et al., v 
cannot account for all the results without t? 
addition of other assumptions. sis 

e Variations of the demotivation зуд 
(McHose, 1970) or frustration theory ipn | 
1968; Krippner et al, 1967) seem bas 
consonant with the existing literature. a 
ever, the complexity of these proposals ae 
a choice among them impossible at the рте 
time. 
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A question of the highest developmental ar 
experimental analysis of imprinting: How d 


their own species in the course of ontogeny? 


nd perceptual significance launched the 
lo animals come to identify members of 
As laboratory rescarch on imprinting 


in birds has become more and more removed from that primary question, it 
threatens to continue to neglect certain fundamental natural history and neuro- 


embryological facts which are basic to the 
specific perception in birds and perhaps other animals. These problems are discus: 


in the context of an empirical review of the 


components of species identification in wild 


In the present review the initial response 
of young ducklings to certain patterns of 
auditory and visual stimulation is examined 
in order to determine their preferences and 
especially to find out how selective the “naive” 
hatchling’s perception of species-specific stimu- 
lation is prior to entry into the imprinting 
situation. The question is whether young 
ducklings are uniquely sensitive to patterns of 
auditory and visual stimulation provided by 
adult members of their own species in advance 
of actual contact or exposure to adults. ‘To 
the extent that “parentally naive” ducklings 
are selectively responsive to certain aspects of 
species-typical parental stimulation, an ex- 
tremely interesting problem is posed for prior 
ontogenetic analysis, namely, how in the course 
of previous individual development does this 
selective process of perception come about? 
In most studies of imprinting, the “naive” or 
preexperimental sensory and perceptual prefer- 
ences of the to-be-imprinted animals are taken 
for granted or ignored. The purposes of the 
Present review are (a) to place these particular 
Sensory and perceptual phenomena in the 
context of the development of species identifi- 
КЕН and! (0) to discuss their more 
Cas tnt stell to Jo Ann Winfree and Mary 
Was provided by the ‘Meck Con eel эд 
Mental Health and by United S rod Eden р 
Service Research Grant HD 008 ap E са 
Institute of Child аА 1 | HP fun the National 

a “nd Human Development. 
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study of the development of spec 
ed 
development of the auditory and visual 
and domestic ducklings. 


eT nt 
significance for the study of the developmen 
of sensory processes and perception. 


OBSERVATIONS IN NATURE 


In nature, ducks and certain other highly 
precocial bird species take leave of the nest 
soon (within from 1 to 2 days) after the young 
have hatched. In initiating the departure from 
a ground nest, the duck hen vocalizes T€ 
currently and slowly walks away from het 
brood, thus presenting them with a certain 
pattern of auditory and visual stimulation 
which differs from species to species. Eo 
brood quickly follows the hen and short? 
thereafter. they begin to feed for the en 
time. Prior to the departure from the nest 


жа agent 
* The megapodes (mallee-fowl, brush-turkeys) prion 
à striking exception. to the generalized Ё de: i not 
presented here. The eggs of these species ie heat 
incubated by the parent but are incubated by t nich 


arising from volcanic ash or the compost heap d тот 
they are buried. Upon hatching, the young Де ai 5 
the nest-site. If they happen to encounter their Pn һе 
in the vicinity, they pay them no mind. AGOE heir 
parents show great concern (behaviorally) ah 
eggs, they appear to have no interest in their 0 ‚ young 
On the second or third day after hatching, s ecs 
are sufficiently precocious to fly up into р h 962) 
where they roost, apparently in solitude (1 i: raul 
The development of species identification in : strali^ 
of birds, most of which inhabit the region 0f 4 igi 
presents an intriguing analvtic prablen 
problem is presented by the parasitic SURE ie nest ig 
cowbirds, et al.) which lay their eggs mk inter?" 
other species of birds. It would be s die ion pl? 
to test the possibility that self-stimulat 5 
important role in the ontogeny 
perception in the megapodes and 1 
laying species. 


he para? 
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the hen offers tactile, thermal, auditory, and 
Occasionally, visual stimulation to her brood 
(Bjürvall, 1968; Collias & Collias, 1956; 
Gottlieb, 1963, 1965). During the nest de- 
Parture she presents them only with auditory 
and visual stimulation. It is of obvious 
adaptive advantage for the young to promptly 
арргоасһ, follow, and stay near the hen when 
she quits the nest. It is in the postnesting 
Period that the young duckling's thermo- 
regulatory ability matures and their acquaint- 
ance with their postnatal environment develops. 


AUDIOVISUAL STIMULATION 


Eric. Fabricius (1951), expanding on the 
van casual observations of Lorenz (1935, 
37), was among the first to observe that the 
Combined presence of rather general auditory 
and visual stimulation substantially increased 
the liklihood of eliciting a following response 
tom young Tufted, Eider, Mallard, and 
loveler ducklings, and his finding has since 
2601. confirmed a number of times and in 
Several different species (Redhead ducklings: 
ollias & Collias, 1956; Mallard ducklings: 
Loyd & Fabricius, 1965; Peking ducklings 
lii". domesticated form of Mallard: Gott- 
Dati 963; Mallard E Wood ducklings: 
E €b, 1968; Bobwhite quail chicks: Porter 
x Stettner, 1968; domestic chicks: Smith & 
vin 1963). These findings articulate well 
follow S situation in nature, SA peer 
Many duck species is initially elicited by the 
Combined 4 xi : i idit ше and visual 
Stim ^» presence of auditory and visua 
as im emanating from the maternal үе? 
hike. leaves. the nest. These findings also 
the id good neurological sense and accord with 
Man i of Intersensory facilitation or summa- 
aS potentiating behavioral responsiveness. 

у inten Important question left unanswered 
the Ys ү facilitation, however, CS 
و‎ specificity of the audioviš = 
Visual n aggregate: Are the auditory г 
in any wo pal properties of the parent birc 
Tes," УЧУ unique in initiating the following 
55 102 ‘the young of her species? To 
= T this question, one must test incubator- 


I —— 
(4; 0 hole-nestir 


sti,’ SPons "E species such as the Wood duck 

nu i ш the hen initially presents only auditory 

Ottie lo her Young during the nest departure 
» 1963, 1965). . 
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TABLE 1 
Burst, RATE, AND DOMINANT FREQUENCY CHARACTER- 
ISTICS OF MATERNAL CALLS USED IN BEHAVORIAL 
‘TS WITH DUCKLINGS 


Note 
Call ——=+= Dominant 
а е frequency 
ele | Average : 
Single | Average (in Hz.) 
| second | 
Mallard hen | 9 | 41 |780,1,25, 1,600, 
2,300 
Chicken hen | 7 | $$ 775 
Wood duck hen 13 | T4 | 1,300 
Pintail hen | 9 mo | 440 
Mandarin hen | 9 9.0 1,325 


repeated 
duration of the tap 
dividing the number of not 
ning of the first note to the end of the la 
Since the notes do not occur at eq intei 
the rate presented for each call is an avera: 


ch call was determined by 
by the total time from the begin- 
note in {һе burst, 

within the burst, 
for the entire burst, 


hatched birds of various species with the 
maternal audiovisual configuration of their own 
species and other species. Consequently, the 
author recorded the vocalizations (Table 1) of 
various duck species in nature in the act of 
leading their voung from the nest (Gottlieb, 
1963, 1965, 1971a) and had ducks of the various 
species stuffed and fitted with an internal 
speaker so the calls could be played through 
them.* The next step was to present these 
moving and calling visual replicas individually 
to young ducklings which had been incubated 
and hatched in the laboratorv. The author 


study of the parameters of audiovisual 
perception it is not a good practice to have the calls 
emanate from speakers inside the visual replicas because 
the internal characteristics of the replica (and the way 
the speaker is mounted) can cause acoustic distortions 
which can influence the bird's response. Most recently, 
therefore, we have used a very high quality midrange 
dome radiator speaker (Acoustic Research, Cambridge, 
Massachusetts), which has a virtually flat frequency 
response from 200-16,000 hertz mounted outside of the 
visual replica. This is a particularly important methodo- 
logical point in simultaneous audiovisual choice tests 
where the same call is produced from (behind) both 
replicas rather than from internal speakers. In the latter 
e, we have observed a response bias which was most 
likely attributable to an alteration of frequency (pitch) 
or timbre produced by one of the speakers in one of the 
replicas. In any event, that particular. (unpublished) 
result could not be reproduced when the undistorted 
call emanated from the wall-mounted speakers, whereas 
our other results were replicable under the improved 


û l'or a pre 


acoustic conditions. 
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Vic. 1. Visual replicas used in the present experiments: Peking hen (at rear), Mallard 
hen (left), Mallard drake (right), and Wood duck hen (center front). 


began by assessing the ducklings’ following 
response to the maternal audiovisual configura- 
tion of their own species in a 20-minute test 
during the first 2 days after hatching. In these 
experiments and the other ones alluded to in 
this review, all the birds were hatched in the 
laboratory and maintained in individual visual 
isolation in brooders in light- and sound- 
treated rooms designed for these specific 
purposes. Unless noted to the contrary, no 
attempt was made to manipulate the birds’ 
experiences prior to testing, so these results 
pertain to birds which had heard but not seen 
their siblings (conspecifics) and had neither 
heard nor seen maternal stimuli 
testing. Since the 
auditory 
plays 


prior to 
amount of prior exposure to 
stimulation from self and siblings 
a role in the response to the maternal 
call ‚а! testing (Gottlieb, 1966, 1971a), an 
explicit effort was made to maintain the birds 
In groups of no less than 20 each in the incu- 
bator and the brooder prior to the test, I 
vird was tested individually and 
rigorous criteria were applied in 
their approach or folle 
in detail in ( 


Zach 
rather 
measuring 
awing response (described 


зое, 19713. Ch. 2). The visual 


replicas are depicted in Figure 1. To appreciate 
the design of the experiments inyana 
Mallard ducklings and their domestic дып 
tives (Peking ducklings), the reader ee 
know that these animals are of the same Ep воҳ 
(Anas platyrhynchos) and that artificial = of 
tion has changed the visual characterise p- 
the hen (color, size) but not her vocal Mn 
toire. "Thus, in the first experiment below, ^. 
Mallard ducklings are tested with the + | 
maternal call emanating from the WE 
hen and the Peking ducklings are n hen 
the same call issuing from the Peking ix 
see Figure 1). The Wood ducklings duck 
sponsa) are tested with the Ww де d duck 
maternal call emanating from the Woo! 
hen. 

As can be seen in ligure 2, 

specific maternal audiovisual con! | 
elicited following in 84% of the М 2% 
ings, 95% of the Mallard duckling®s © ris 
of the Peking ducklings. When be est? 
is deleted and the birds 2 


allarc 


the spec 
nfigura 


'omponen =. arate. ae 
compo: t Күрдү epatios 21 qe 
to the maternal call of their s] Ws 
rom a nonvisible, moving sounc e$] 


2707, 
: chr noL and 5255 
figures drop to 59%, 50%, ane 


SPECIES IDENTIFICATION IN BIRDS 365 


100 
Mallard Ducklings (N 60) 
ن‎ % oPeking Ducklings (9:93) 
ж x Wood Ducklings (N.72) 
80 
о \ 
S 70 
2 
č 60 
25 
A 0 
2 
5 40 
2 
5 30 
8 
Š 20 
ij 
goo 
0 
Maternal Maternal Maternol 
Audiovisual Coll Visual 
Combination Replica 


The percentage of Mallard, Peking, and Wood 
КҮ н 5 which followed their re ive species 

‘ternal call and a visual replica of a hen of their 
Species, separately and in combination. (Data taken 
from Gottlieb, 19712.) 


lively. When the auditory component is 
deleted and the birds are tested to the silent, 
Moving, visual replica of a hen of their own 
Species, the figures drop even further to 11%, 
30%, and 7%, respectively. Thus, the species- 
Specific maternal audiovisual configuration 
tlicits a higher degree of following than the 
Purely. auditory component, and the auditory 
Component is more effective than the purely 
visual component. 

To examine the question of the possible 
Uniqueness of the species’ maternal audiovisual 
guration, the Wood duck maternal call 

as substituted for the Mallard call, and 
Mallard апа Peking ducklings were tested with 
с Wood duck call emanating from their 

Spective maternal visual replicas (Mallard 
ten and Peking hen) In this case, their 
we response dropped to 60% (Mallards) 

Ч 70% (Pekings) from the 90% level when 
2 Mallard call issued from these replicas 
of Кк ы 3). In order to assess the effect 
üs wa ee the maternal visual rene 
Mallard а the maternal auditory pe , 

eir Bd Peking ducklings were testec or 
а owing response when the М ood duck 
yes from a Mallard drake ppl 
a Е кр hen. In this instance, 6990 of Ше 
Take 4. апа 51% of the Pekings followed t ne 
the p and 50% of the Mallards and 44% of 


(pj, C ings. followed the Wood duck hen 
ure 3) е 0 


EDEN E 
Sbecic. ^ then, the results indicate that the 


S-Specit n 288 Е 
Pecific maternal audiovisual configura- 


tion is the single most effective object for the 
elicitation of the following response. When a 
substitution was made in the auditory com- 
ponent, the response declined significantly and 
when substitutions were made in both the 
auditory and visual components, the Pekings’ 
response declined somewhat further and the 
decline in the Mallards’ response remained 
about the same. 

In the final experiment on this topic, only 
the visual component was changed: The 
Mallard call was made to emanate from the 
Mallard drake and the Wood duck hen. Under 
these conditions, 83% of the Mallards and 90% 
of the Pekings followed the Mallard drake 
and 100% of the Mallards and 100% of the 
Pekings followed the Wood duck hen. Thus, 
when the visual component is changed and 
the species-specific auditory component is 
retained, the following response of Peking and 
Mallard ducklings climbs back to the previous 
high level obtained when the species’ maternal 
emanates from the maternal replica. 
(The outcome of these last two experiments is 
summarized in Figure 3.) 

These results indicate two things. First, the 
species-specific maternal audiovisual configur- 
ation is not uniquely effective in eliciting the 
following response and, second, the attractive- 
ness of the species’? maternal call seems to 
override (or compensate for) changes in the 
visual component. In essence, there is little 
or no decline in the following response of 
Mallard and Peking ducklings when the 


100 pe 
= * о a 
£ 90 
е . 
$ 80 Mollord Call 
Pn 5 D 
ҺЕ . 
3 50 о г 
© ° 
2 40 Wood Duck Call 
$30 
$ 20 
3 e Mollard Ducklings (N +114) 
sm о Peking Ducklings (195) 
Иши Peking Mollard Wood Duck 
Hen Hen Droke Hen 


Fic. 3. The percentage of Mallard and Peking 
ducklings which followed various visual replicas when 
is emitted either the Mallard maternal call 


the repli 
or the Wood duck maternal call. (In all cases the 


‚ Mallard call instigated a higher degree 
an did the Wood duck call. Data taken 


presence of the 


of following th d 
from Gottlieb, 197 1a.) 


| 
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Fic. 4. Maternal auditory versus maternal visual choice test. 
hen is placed in opposition to her call emanating from a moving, nonvisible sound source 


GILBERT GOTTLIEB 


(In this test the replica of a silent, moving 
»eaker 


mounted inside a plywood box behind an opaque curtain. A Mallard duc kling is shown following the 


source of the call in the above photograph.) 


Mallard call accompanies any of the visual 
replicas, but there is a significant decline in 
responsiveness when it is replaced by another 
call, even when that call issues from the 


TABLE 2 


INCIDENCE or APPROACH RESPONSE Or PEKING AND 


MALLARD DUCKLINGS TO MATERNAL CALLS 
OF VARIOUS SPECT 
Maternal calls 
Ducklings pes Tm Em NE anm 
Mak | Binaj | Wood Man- |. 
| јака | Poss | dück | eh |‹ hicken 
Pekings | 76% | во | 60% | 28% | s 
(а = 125) | | | | 
Mallards 81% 1067 А 
alla ‘ 80% | 40r 5% | 7567 
(NE t о | 40% | 25% 75%, 


ted only once with each call; data taken 


Mallard hen or the Peking hen. These peso 
Suggest that, in the initial stage of postna ti 
development in ducklings, species identifica it 
is based primarily on the auditory compon ri 
of the maternal audiovisual-stimulating pa 
plex. They also suggest that the Paue 
specific auditory component may be Md 5 
in its effectiveness in eliciting the respons da 
young ducklings, These two different pP 
lions are examined further in the mex 
sections. 


А E ATION 
AUDITORY VERSUS VISUAL STIMULA that 


To further document the su the 
species identification is based priman а 
auditory aspect of the maternal ай moving) 
configuration, the two component atte 
were presented separately but qe did dins 
to Wood ducklings and to Mallar 


— — 
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TABLE 3 


ғ PEKING AND MALLARD DUCKLI 


IN SIMULTANEOUS AUDITORY APPROACH TESTS 


Preference (и) 


Choice test calls п Responded U) | -— — 
| Other | Both 
Pekings | | | | 
Mallard versus Pintail 21 | 19 19 0 | 0 
Mallard versus Wood | | | 
duck m | 2; | 24 | 3 0 
Mallard versus chicken 30 | M | H 0 0 
Mallard | 
Mallard versus Pintail 20 19 | 18 | 1 0 
Mallard versus Wood | | | | 
duck | 22 19 | 18 | 1 0 


а. => е ! Б 


Vole, Data taken from Gottlieb, 1971a. 
M a choice test situation depicted in Figure 4. 
Although no behavioral evidence of conflict 
Was observed, only about half the birds re- 
Sponded, and all of them followed the call. 
(The response figure—53% for the Wood 
ducklings and 45% for the Mallards-—is close 
to the level of response addressed to the 
Maternal сай presented by itself under the test 
Conditions [Figure 2].) These results support 

© notion that species identification is 
опаде primarily on the auditory stimulation 
Provided by the parent bird. 


AUDITORY STIMULATION 


«е nest question concerns the uniqueness 
of s ы becies maternal call in the development 
p inr identification : Are incubator-hatched 
5 uniquely sensitive to the maternal call 
› their own species? To answer this question, 
eking and Mallard ducklings were tested for 
pe response to a variety of maternal calls 
fete шл approach test, each bird being 
in Tal "i once to one of the calls. As shown 
Several € 2, the birds are equally responsive to 
hus P3 of the calls in the series, and there Б 
о evidence of the unique attractiveness 
217 own species call under these test 
inb. When the birds are given simul- 
he Ne. шү choice tests, however, рш 
Slama] A call of their species against the 
ае у calls of the other species to which they 
Pekingo 'ésponsive in the single-call test, the 
Son stent Mallards do show a strong and 
fall i evi Preference for the Mallard maternal 
"ery case (Table 3): 


Cong 
1, 


"Thus, although the birds approach a rather 
wide range of acoustic patterns presented 
singly, they demonstrate a unique preference 
for the particular acoustic pattern provided 
by their own species’ maternal call when it is 
placed in direct (simultaneous) comparison 
with the maternal calls of other species. The 
unique sensitivity or preference which the 
birds have for the maternal call of their own 

5 aled only in simultaneous choice 


ROUND OF THE AUDI- 
AsPECTS OF SPECIES 


: PROBLEMS AND CON- 
SIDERATIONS FOR FUTURE 
RESEARCH 


Upon considering the ducklings’ response to 
the auditory and visual components of a hen 
of their species during the first few days after 
hatching, a number of different lines of evi- 
dence point to the prepotency of the auditory 
component over the visual component during 
this early period. In this instance, “pre- 
potency” means that on any measure the 
maternal call plays a more powerful energizing 
and directive influence than a visual replica 
of the hen. The behavioral evidence for the 
prepotency of the auditory component over 
the visual component conforms rather well 
to what is known about the neuroembryo- 
logical development of the sensory systems, 
as well as the embryo’s and hatchling’s 
sensory and perceptual experience prior to 
its entrance into the testing situation. The 
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latter very possibly has a most significant 
bearing on the present results. Before discuss- 
ing those important factors, let us briefly 
‘review some other behavioral evidence for the 
primacy of the auditory perceptual system 
over the visual system early in avian 
development. 

1. When Peking ducklings have been follow- 
ing one of two similar maternal visual replicas, 
each emitting a different call, and the calls 
are switched between replicas in the midst 
of the test, the ducklings either transfer their 
response to the other replica or completely 
terminate following the first replica (Gottlieb, 
1966): This is a rather complicated result as 
it stands, and it most certainly warrants 
further examination using dissimilar replicas, 
etc., but there is some indication here that in 
the midst of a conventional imprinting or 
training. situation involving an audiovisual 
model that (initially, at least) the ducklings 
are more profoundly influenced by the auditory 
component than they are by the visual com- 
ponent. This conclusion, if verified across a 
broader spectrum of conditions, could have 
important implications for studies of visual 
imprinting in which the training. condition 
involves an audiovisual model and the testing 
condition involves only the visual component. 
In some cases like that, the birds do not 
choose the “imprinted” object in the later test 
(e.g, Klopfer, 1971); that is, thev do not 
actually become imprinted to the visual aspect 
by itself. In an experiment in which the 
procedure was reversed—where the birds were 
trained to an audiovisual object and later 
tested with only the auditory component—-the 
birds did show evidence of auditory imprinting 
in the later test (Evans & Mattson, 1972). All 
of these results taken together suggest that in 
а conventional laboratory imprinting situation 
the birds may be more profoundly affected 
by the auditory component of an audiovisual 
object than by its visual component. This 
Possibility has almost been lost in the tradi- 
tional overemphasis on the 
problem, 

2. So far situ 
duck calls 
been discu 


visual side of the 


ations involving only actual 
and visual replicas of ducks have 
ssed. 


| рге- 
* auditory over {һе visual system 
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is even more clearly in evidence when artificial 
or synthetic stimulation is used. l'or example, 
chicks select auditory flicker over visual flicker 
in a simultaneous auditory versus visual choice 
lest, even when the stimuli have been equated 


for their attractiveness in singular presentations 


(Gottlieb & Simner, 1969), While the previous 
experiments indicate that the duck calls are 
more attractive or compelling to the ducklings 
than the visual replicas of the ducks, the 
present experiment demonstrates the be- 
havioral prepotency of the auditory system 
per se over the visual system per se since, 
in the present case, the evocative nature of 
the test stimuli has been equated. In the 
previous experiments, it could be assumed that 
the calls were merely more evocative than the 
replicas and nothing more. The present 
experiment suggests that the prepotency of 
auditory over visual stimulation in the ap- 
proach and following response of voung duck- 


4 5 amental 
, lings and chicks may represent a fundamenta 


functional difference in the status or role of 
these two sensory systems at this early stage 
of postnatal development, at least unde 
given conditions of prior auditory and visua 
experience, — | Е. 
3. Strong hints of an important functiona 
difference between the auditory and visua 
modalities have also been present in the work 
of others on chicks (Fischer, 1966; Smith & 
Bird, 1963) as well as quail (Porter & Starmen 
1968), even though in those studies no attemp 
was made to assure the initial equivalence 9 
attractiveness of the stimuli employed. M ® 


1 

AE "v ‚ voung 

of these rather different studies, the У es 
ive 


aspect 


birds demonstrated some kind of pos! 
havioral preference for the auditory | 
of an audiovisual complex of stimulati¢ 
the face of so much literature dealing W! 


i i i init ‚ visut 
visual side of imprinting, and the vi e 
i 


jn. 
th the 
| side 
иер 
s ; - (hat n У 
of early experience in general for that 


at 
conclude tha y 
es mo? 


it seems significant to : 
behavior of the young of the bird specie ja ril 
often used in such research is guided ent 
by auditory stimulation, at least with spec 
to the initial phase of development 9 ій 
identification. Let us turn next to а C? ju be 
tion of the ontogenetic factors 


responsible for such a conclusion. 


which € 


Neuroembryclogical and Sensory Stimulative 


E Factors 


When We consider the nature of the earliest 
auditory and visual stimulation. encountered 
by bird embrvos, as well as their neuroembryo- 
logical development, the ontogenetic basis 
for the Carly domination of the visual Буб- 
tem by the auditory system becomes rather 
clear. In the chick embrvo, for example, there 
IS some neuroanatomical and physiological 
evidence that the peripheral auditory system 
May be capable of transmitting impulses at 
around Day 12-13 of development, which is 
Tom 7 to 8 days prior to hatching (evidence 
reviewed by Gottlieb, 1968). By Day 16 or 17, 
` the chick embryo’s auditory system has 

advanced to the point where a rudimentary 
"Ind of auditory conditioning can take place 
Gottlieb, 1968). Its visual system, on the 
Other hand, shows the first evidence of function 
Pupillary reflex) on Day 16-17 (Heaton, 
971), and the first signs of electrical activity 
m the retina are detected around Day 
17-18 (Garcia-Austt & Patetta-Queirolo, 1961; 
>м Clers, Vonderahe, & Powers, 1958) or later 
(Witkovsky, 1963). These and other findings 
is Gottlieb, 1968, for a more complete 
midi" Indicate the relatively early onset of 
Е. ind function and the relatively late onset 
embp a function in the chick from a теш 
ir ^ ogical perspective. (The earlier func- 
АК onset of the auditory system compared 
© the visual system is not an isolated phe- 
aan restricted to birds or chicks; the 
рес] 18 true for a variety of mammalian 
y Ge including humans [literature reviewed 
`, Ottlieb, 1971b].) 
(ges auditory vstem is not only capable of 
i i. 90 earlier than the visual em, but 
unctio morg varied opportunity for such 
ound E Consider, for example, that various 
bot енн can penetrate the egg and that 

S nsu ek and duck embryos can begin 
| Own NS themselves aurally through their 
I 


Mns uq e | 
Ing Сш tons several days prior to hatch- 
ligh Жш & Vandenbergh, 1968). Though 
ор E So Сап penetrate the egg, the embryo’s 
the o tumit 3 


egg eed fora varied visual experience inside 
я Com Arê Or to hatching would seem much less 
шу sti to its opportunity for varied audi- 
mulation, By way of enhancing the 
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functional differences between the two systems 
after hatching, it should be noted that most 
investigators of imprinting keep their hatch- 
lings in visual but not auditory isolation 
during the time prior to experimentation. To 
assess the contribution of isolation (depriva- 
tion) to the present conclusions, it would be 
very informative to reverse those rearing 
conditions and test the birds? auditory and 
visual preferences using equivalent stimuli as 
above. According to a point of view expounded 
in considerable detail elsewhere (Gottlieb, 
1971b), the preference for the visual test 
object should increase under these circum- 
stances, the idea being simply that the visual 
or auditory responsiveness of the birds is 
retarded under conditions of isolation and 
accelerated. under conditions of stimulation. 
In this regard, it is relevant to point out that 
in every one of the previously cited studies 
which indicated some sort of positive prefer- 
ence for, or stronger reaction to, the auditory 
aspect over the visual aspect of an audiovisual 
object, the birds were maintained in visual but 
not auditorv isolation from the time of hatch- 
ing to testing (specifically, Fischer, 1966; 
Gottlieb, 1971a; Gottlieb & Simner, 1969; 
Porter & Stettner, 1968; Smith & Bird, 1963). 

In summary, the relatively late neuro- 
embrvological development of visual sensi- 
tivity compared to auditory sensitivity, com- 
bined with the opportunity for a lesser degree 
or amount of varied visual stimulation com- 
pared to auditory stimulation (both before 
and after hatching under most circumstances), 
would seem to conspire to assure the early 
domination or advancement of auditory 
perception over visual perception early in 
avian development. When one harks back to 
the naturalistic scene (Gottlieb, 1963, 1965), 
it will perhaps be recalled that the earliest 
"social" interchange between the incubating 
parent and the young in the process of hatch- 
ing is an auditory one (ie., a vocal inter- 
change), and one would predict that, under 
these circumstances, the voung might be 
capable of learning the individual peculiarities 
of their maternal parent's call in advance of 
learning her particular visual attributes. (Some 
indirect. support for this proposition stems 
from the observation that guillemots, Uria 
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aalge, learn to distinguish the individual calls 
of their parents before they hatch [Tschanz, 
1968]. So far we have only weak evidence of 
the capacity for individual auditorv recogni- 
tion by young ducks [ Ramsay, 1951], but it 
has been demonstrated very clearly in other 
species [ Beer, 1969; Evans, 1970].) Second, 
under the usual conditions of incubation, 
hatching, and brooding in nature, and the 
usual conditions of incubation, hatching, and 
brooding in the laboratory, it may be that the 
learning of the visual attributes of the hen 
(or an artificial visual surrogate) is facilitated 
or accomplished in the context of exposure 
to her call. (This proposition receives indirect 
support from the fact that many experimenters 
on visual imprinting find it “useful” to have 
an auditory signal accompany the to-be- 
imprinted visual object during the initial 
learning or exposure period.) These predictions 
are closely tied to the prior neuroembryological 
history and usual sensory stimulative back- 
ground of the hatchling. Thus, for example, as 
far as the latter is concerned, it is important to 
recall that the usual course of auditory percep- 
tual development in ducklings is itself altered 
when merely quantitative departures from 
normally occurring auditory stimulation are 
introduced during embryonic (Gottlieb, 197 1a) 
or even postnatal phases (Gottlieb, 1966). To 
be specific, augmentation or premature ad- 
ministration of normally occurring auditory 
stimulation accelerates auditory perceptual de- 
velopment. in embryos as well as hatchlings, 
while deprivation has the reverse effects on 
both embryos and hatchlings. A recent study 
by Reese, Schotté, Bechtold, and Cowley 
(1972) demonstrates the enormous influence of 
rearing condition on later choice behavior in 
young chicks. 


Specific and Nonspecific Developmental Back- 
ground Factors 


An imposing and informative task for the 
future is the disentanglement of specific from 
nonspecific sensory stimulative background 
factors in the development of perception. 
At the moment, it cannot be said for certain 
Whether prior auditory and visual stimulation 
merely play a rather nonspecific role in the 
ые maturation and development of 

ОТУ systems, or whether specific 
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features of the prior stimulation also determine 
or influence the rather specific perceptual 
preferences that are evident later on in 
development (for example, the preference for 
the maternal call of the species, the preference 
for certain repetition rates, intensity, cte). 

In order to accomplish such a genuine 
advance in understanding the development of 
perception, it is necessary (а) to analyze the 
particular attributes of the stimulating com- 
plex to which our experimental subjects are 
responsive (rate, frequency, modulation, wave- 
length, internal configuration, intensity, ete), 
whether the stimulating complex be auditory 
or visual and, with this information in hand, 
(b) to try to make a highly specific link with 
Some particular aspect of the bird's prlor 
sensory experience. This challenging task 15 
complicated somewhat by the interesting buk 
as yet unexplored possibility that ѕресійс 
features (e.g., rate) of stimulation in one 
sensory modality may influence the develop- 
ment of preferences in another modality (55 
Simner, 1969). 

Since the sensory systems become functional 
sequentially rather than all at once (Gottlieb, 
1971b), there is some ontogenetic basis 107 
Schneirla’s (1965) hypothesis that certain 
features of embryonic tactile and proprio" 
ceptive stimulation may influence preference? 
in the later-developing auditory and visu? 
modalities. These kinds of intersensory experi 
ments in birds, which would represent pb 
advanced stage of inquiry, require a pace 
analytic understanding of the key attribut ; 
of the stimulating complex as a first NU 
otherwise, there is no basis to establish ee 
with prior experience in another sensory iry 
dality. Of course, the first stage of no 
would attempt to establish develop ory 
links within rather than between seP ^ 
modalities. | ple 

А more general way to pose this pre s 
to ask: Why are certain "training" objec В 
much more effective than other stin nel 

= ie 8 ; ontoge 
complexes? That is, why, i" vm resp E 
sense, do young chicks or йаш ПЕ pl 
more readily to certain stimulating atio? 

) ; - аі present im 
rather than others upon initial p ‘cc. 
This question—so provocative «m devel? n 
portant for our understanding of t ad mut 
ment of perception—-has been Vf 
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SPECIES IDENTIFICATION IN BIRDS 


neglected in the experimental analysis of 
Imprinting. 


SUMMARY AND C “ONCLUSIONS 


The most effective object for eliciting the 
approach or following response from “naive” 
ducklings is one which emits a recording of 
the call of the hen of the species as she leads 
the young from the nest in nature, even 
though the young have not been previously 
exposed to the hen's call. If an object emits 
the Species-specific maternal call, its visual 
characteristics can vary widely with respect 
to size, color, and marking without signifi- 
cantly reducing its attractiveness to the 
ducklings. If another call is introduced, one 
With different audiometric characteristics than 
the species’ maternal call, all. visual objects 
tested become significantly less attractive. 

As the results of auditory choice tests show, 
the birds come into the imprinting situation 
already “tuned” to the distinctive audiometric 
Characteristics of the maternal assembly call 
е their own species; thev selectively respond 
din e preference to other calls which have 
not ta audiometric characteristic ШЕ is 
replica case, however, with a purely visual 
om of a hen of the species, nor even an 

Priately vocalizing replica of the hen— 

де ducklings are not selectively responsive 
E. visual replica of a hen of their own species, 
call, quien it is emitting the species’ maternal 
i: E hus, species identification in ducklings 
and ‘ally founded on auditory perception 
Char, m nature, the learning of the visual 
Coni istics of the hen takes place in the 
of ( Xt of the duckling’s response to the call 

he hen, 

Audi а neuroembryological perspective, the 
develo y System of birds (and other animals) 
Which yj : advance of their visual system, 
Visual ee toa behavioral domination of the 
Carly aros by the auditory system, at least 
Drog ^ evelopment. While neural maturation 
Onset E involved- that is, a different 
stems үр of maturation of the two sensory 
i ау also e Sensory stimulative environment 
e Contribute to the early disparity 


etw. 
s ееп 5, " $ 
impp ^7 2Uditory and visual function. In most 


, Drin * . 
Ate ng Studies, for example, the hatchlings 
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"ined in visual but not auditory 
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isolation prior to their entrance into the 
imprinting situation. 

The initial, so-called naive perceptual prefer- 
ences of animals have long been overlooked 
as a fruitful starting point for developmental 
study; this is nowhere more evident than in the 
literature on avian imprinting where there is, 
in fact, the least justification for such an 
oversight. Namely, there is now considerable 
evidence to show that the auditory and visual 
systems of chicks, ducklings, and other avian 
species are functional prior to hatching, and 
stimulation during the embryonic period is 
known to affect the behavior of the embryo 
as well as that of the hatchling, so attribution 
of perceptual naiveté to young birds which 
have been communally incubated and brooded 
in noisy and illuminated surroundings hardly 
seems reasonable. In fact, the failure to closely 
control or monitor the sheer amount and kind 
of preexperimental auditory or visual stimu- 
lation, both before and after hatching, ignores 
the demonstrated effects of these variables on 
behavior in imprinting-type situations (e.g., 
Bateson, 1964; Dimond, 1968; Gottlieb, 1966, 
& 1971a; Gottlieb & Klopfer, 1962; Grier, 
Counter, & Shearer, 1967; Haywood, 1965; 
Impekoven, 1973. A fully developmental 
understanding of behavior needs to take into 
account the sensory environment and per- 
ceptual capabilities of the embryo as well as 
the hatchling. 
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NONNECESSARY CONDITIONS OR USEFUL 
PROCEDURES IN DESENSITIZATION: 


A REPLY TO WILKINS 


WESLEY G. MORGAN ! 


University of Tennessee 


Muscle relaxation, a graded hierarchy of fear-relevant scenes and imagination 
of these scenes concommitant to muscle relaxation are three procedures defining 
Wolpe's Systematic Desensitization which Wilkins claims are unnecessary. 
These claims follow from a somewhat prejudiced review of largely analogue 


studies and may be misle: 


ing to the practicing behavior therapist. Explana- 


tions of the success of desensitization in terms of cognitive and social variables 
seem to have no more support or utility than Wolpe’s original theoretical 
formulations, It would seem premature to discard the defining procedures of 
desensitization as unnecessary when they are quite useful and perhaps neces- 
sary for success in treating highly fearful patients. 


on Behavior therapists may be justly troubled 
by gna A theoretical, and practical grounds 
Wilkie conclusions and suggestions offered by 
Si (1971). Wilkins states that the ef- 

Veness of the systematic desensitization 


ack : : 
Package (which he does not seem to dispute) 
Seems 


Б " 
CA social variables involved in the patient- 
ing ox ы relationship and cognitive variables involv- 
Басе ap Anc of therapeutic gain, information feed- 
and vic Success training in the control of attention, 
havior ОЧА learning of the contingencies of be- 
rough instructed imagination [p. 3111. 


pits View has the unfortunate effect of mini- 
ing extremely important procedures and 
Perations while substituting for them con- 
Pts which are poorly defined operationally 


ex lor which comparatively little support 
Xists. 


Necrssrry or MUSCLE RELAXATION 


de йв (1971) is correct in asserting that 
i E relaxation is not a necessary con- 
One а 9F desensitization” to take place if 
E to mean by “desensitization” a 
deg, On in fear behavior rather than the 
Ing operations of Wolpe's procedure. 


° a 
Cray YEr, it is important to point out the sev- 


Se i . а Е 
~— "ses in which muscle relaxation is not 
1 har cae. 
E Masts for reprints should be sent to Wesley 
t olore Psychological Clinic, Department of 
n Essee jp Diversity of Tennessee, Knoxville, 


necessary. (a) It is not necessary in the 
sense that other anxiety-inhibiting responses 
or conditions may be used as well. In fact, 
any condition or response which is incom- 
patible with the experience of anxiety would 
be acceptable for use in the countercondi- 
tioning procedure as Wolpe has outlined it. 
For example, Wolpe and Lazarus (1966) re- 
port the use of emotive imagery, feeding re- 
sponses, conditioned motor responses, and 
sexual responses as anxiety-inhibiting opera- 
tions, and there is no reason to believe that 
this exhausts the possibilities. (5) It is not 
necessary in the sense that one could become 
desensitized to a stimulus through simple ex- 
tinction (omitting the anxiety-inhibiting op- 
erations) as well as through countercondi- 
tioning procedures (Malleson, 1959; Stampfl 
& Levis, 1967; Wolpin & Raines, 1966). How- 
ever, the practicality and comparative utility 
of these other techniques for severely phobic 
patients still needs support. And (c), not in- 
dependent of û above, relaxation may not be 
a necessary condition for desensitization to 
take place with subjects demonstrating a 
weak or mild anxiety as opposed to a severe 
anxiety. 

In contrast to papers cited by Wilkins 
(1971), other studies with human subjects 
have shown facilitative effects of relaxation 
in the reduction of fears of nonclinical pro- 
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portions. (Davison [1965], 1968; Rachman, 
1965). Cooke (1966) has shown that subjects 
with higher Bendig Emotionality Scores im- 
proved more when their treatment employed 
relaxation than when it did not, and Schubot x 
was also able to demonstrate that relaxation 
was very important in modifying the fears of 
highly fearful subjects while it did not fa- 
cilitate the reduction of fear in moderately 
fearful subjects, Thus the importance of re- 
laxation may be positively correlated with the 
intensity of the original fear and would be of 
relatively greater importance to the clinical 


practitioner than to the analogue researcher 
or theoretician. 


NECESSITY OF A GRADED HIERARCHY or 
Frar-RELEVANT SCENES 

Wilkins (1971) has claimed that. individ- 
ually tailored hierarchies of feared scenes are 
unnecessary for successful desensitization. The 
experimental studies cited in support of this 
claim have been analogue studies, and one 
must consider how relevant these studies have 
been before accepting this conclusion. 

All the analogue studies have used spatial- 
temporal hierarchies (Paul, 1969) while those 
problems encountered in applied clinical situa- 
tions are much more likely to involve thematic 
hierarchies, (Hain, Butcher, & Stevenson, 
1966; Marquis & Morgan, 1968; Wolpe, 
1961). With these more complex kinds of 
hierarchies numerous graded scenes ar 
likely to be required. There is also reasonable 
doubt that the participating subjects actually 
Were as severely fearful as one would likely 
Obtain in a clinical case, As Ber 
Paul (1971) have noted, “To the extent that 
Strong anxiety responses are not elicited, the 
need for lower hierarchy items at all, let alone 
their order, duration, manner of presentation, 
and so forth, becomes theoretically irrelevant 


е more 


nstein and 


[p. 233]." 
c — 

*G. C. Davison. The influence of systematic de- 
Sensitization, relaxation, and graded exposure to 
Imaginal aversive stimuli on the modification of 
Phobic behavior Unpublished doctoral dissertation 
Stanford University, 1965. 
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IMAGINATION ОЕ FEAR-RELEVANT SCENES 


Wilkins (1971) states, "Indeed, the only 
necessary condition appears to be imagination 
of fear-relevant scenes not necessarily T 
ranged into a hierarchy and not necessarily 
concommitant to muscle relaxation Ip. 311]. 
If one were to seriously take this position, it 
would be encumbant on him to explain why 
patients with nightmares and obsessions e 
cerning feared situations are not always “spo! З 
taneously” cured, Surely there must be othe 
factors of influence. Wilkins suggests three 
other factors which he claims operate in the 
desensitization situation: (a) expectancy O 
therapeutic gain, (b) the therapist as a e 
forcer of nonfear behavior, and (c) the ro e 
of information feedback, all of which he claims 
are highly influential in affecting outcome. 


EXPECTANCY Or THERAPEUTIC GAIN 


Wilkins (1971) argued that the expectancy 
of therapeutic gain contributed “а large pa "1 
to the actual gain obtained in systematic de 
sensitization. Since then, in a more carefully 
prepared critique of this area, Wilkins (1973) 
concludes, “that the construct ‘expectancy 0 
therapeutic gain’ emerged prematurely an 
without the empirical Support necessary : 
establish its validity." It is evident fron 
studies cited in this more recent critique wi 
others (Borkovec, 1972; Davison (see Pos 
note 1); Lang, Lazovik, & Reynolds, pu 
McGlynn, 1971) that while one cannot -— 
that expectancy plays no role in the outco ii 
of desensitization (one cannot affirm the pe 
hypothesis), one can conclude that oe in 
proved extremely difficult to demonstra ne 
the context of systematic desensitization, ó at 
that it most probably plays a minimal Pus 
best in the achievement of obtained results 


БАК 
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THERAPIST AS A REINFORCER OF NO 
BEHAVIOR AND THE ROLE OF 
INFORMATION FEEDBACK 


sn the 

А ith 
Most would not wish to yes cement: 
effectiveness of contingent reinfo or 


whether in the form of social reinforce p. 
information feedback, for increasing = yio” 
ability of adaptive, appropriate awe 
These operant techniques can, ani ness 
do, add significantly to the effective 
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efficiency of the desensitization package. But, 
in the absence of direct demonstrations, to 
claim that these are “critical variables" for 
Successful therapeutic outcome in desensitiza- 
tion seems to go considerably beyond the data 
available, and to suggest that the major op- 
erations of Wolpe's systematic desensitization 
Procedure are not useful in the actual treat- 
ment of severely fearful patients would be 
quite premature. 
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DESENSITIZATION: 
A REJOINDER TO MORGAN 
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University of Miami 


Specific points raised by Morgan are debatable on empirical and logical 
grounds; however, the distinction between necessary and useful operations 


provides a meaningful framework for the advancement of 


theory, practice, and research. 


The major controversy involving systematic 
desensitization does not concern its effective- 
ness (Paul, 1969a, 1969b), rather the mecha- 
nisms responsible for fear reduction, By dis- 
tinguishing between necessary and useful ele- 
ments of Wolpe’s (1958) procedure, Morgan 


(1973) has facilitated the search for basic 
mechanisms, 


NECESSARY OPERATIONS 


To meet the defining operation of pairing 
imagination with relaxation, three types of 
instructions are used: inducing muscle relaxa- 
tion, directing attention toward scenes, and 
shifting attention from scenes. Since, as 
Morgan (1973) agrees, muscle relaxation is 
not necessary for fear reduction, instructions 
directing attention toward and from fear-re- 
lated scenes remain the only necessary opera- 
tions. This necessity of scene imagination 
does not logically mean, as Morgan (1973) 
implies is Wilkins’ (1971) position, that 
imagination is also swficient for fear reduc- 
tion. 


USEFUL OPERATIONS 


Morgan (1973) states that muscle relaxa- 
tion is not necessary for fear reduction when 
simple extinction is used. However, in the 
articles cited as using “simple extinction,” not 
only are anxiety-inhibiting operations omitted 
but anxiety-maximizing operations are inten- 
tionally included. As such, Morgan’s (1973) 


1The author wishes to thank Wesley Morgan 
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Preparation of this paper. 
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desensitization 


simple extinction argument becomes an unin- 
formative tautology: Procedures which in- 
hibit anxiety are not necessary if procedures 
which elicit anxiety are used. А 

If muscle relaxation is necessary in treating 
severe anxiety (as suggested by Morgan 
1973), the implosion procedure should not be 
effective with severe anxiety. However, since 
reports by Hogan (1966, 1968) attest to 118 
effectiveness іп treating clinical cases, E. 
structed muscle relaxation does not appear 
necessary even in treating severe anxiety. — 

While neither feeding, motor, sexual, nO" 
relaxation responses (Morgan, 1973) are pe 
essary for fear reduction, each is useful ant 
each may function as a tactical maneuver 
(Davison & Wilson, 1972) by providing 2 


; = Я : 15 
direction for attention when attention 
shifted from imaginal scenes. 
Directions ror RESEARCH 
dure 


The possibility emerges that any procedt à 
Which contains instructions to shift atten 
toward and from symptom-related scenes ad 
facilitate symptom reduction. However, C 
instructed attention shifts may not be E Ж 
cient for improvement, а delineation of the а 
ditional necessary and useful operations 5 in 
rounding instructed attention shifts 15 
order. nob 

Morgan (1973) calls for the validation д. 
results of analog research on clinical par 25 
tions апа Раш (1969) calls for the оро 
tion of therapist variables іп дееп? which 
It appears important to identify pn are 
conditions and for which populate by 
which procedures most useful. Stu 911), 
Evans (1968), Farmer and Wright ne pat 
Grim (1971), and Lang, Melamed, ^. 
(1970) exemplify this research direc 
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GAMBLE'S REVIEW OF THE HOLTZMAN INKBLOT TECHNIQUE: 
CORRECTIONS AND CLARIFICATIONS 


JON D. SWARTZ: 


University of Texas at. Austin 


Certain statements by Gamble regarding results obtained with the Holtzman  - 


Inkblot Technique (HIT), 
technique, 


and the relationship of the HIT to the Rorschach 
are presented and corrected. 


. Specifically, previous results obtained 


for the HIT Pathognomic Verbalization (V) score are clarified; the distinctions 


between Rorschach and H 
l4 
Gamble (1972) has provided a generally 
good, basic review of the Holtzman Inkblot 
Technique (HIT). Given the unique status of 
the Psychological Bulletin in psychology, his 
review probably will influence those working 
with the HIT for many years to come; unfor- 
tunately, several of his statements either are 
in error or in need of some clarification. 

In an evaluative statement concerning re- 
sults obtained for the HIT Pathognomic Ver- 
balization (V) score in two studies of develop- 
mental level carried out by Thorpe and Swartz 
(1965, 1966), Gamble reported that “V first 
declined and then rose again” in the first 
study and that “at variance with the earlier 
data, V was found to decrease with age |p. 
179]” in the follow-up study. In point of fact, 
V scores declined from a mean of 11.4 at age 
5 years to 5.6 at ages 8-9, to a low of 2.6 at 
ages 12-14, and then rose to mean scores of 
6.6 and 6.1 at ages 16 and 18-19 years, re- 
spectively, in the initial study; in the fol- 
low-up investigation mean V scores declined 
from a mean of 7.4 at age 6.7 years to 5.1 at 
age 9.7 to a low of 2.3 at age 12.7 years, the 
oldest criterion group in the follow-up study. 
A low score of 2.6 at age 12.7 years in the ini- 
tial study is hardly “at variance” with a low 
Score of 2.3 at age 12.7 years in the follow-up 
Study. In addition, in а cross-cultural valida- 
Поп of the results of these first two studies 
of HIT perceptual development scores, it was 


1 Requests for reprints should be 


Hogg Building, 
ехаѕ at Austin, Austin, Texas 78 


sent to Jon D. 
the University of 
712. The author 


IT Human, Movement, 
sented. Some recent extensions of HIT research a 
other minor errors are pointed out and rectified, 


and Color scores are pre- 
re discussed, Finally, two 


found that 100 Mexican school children, ro 
at age 12.7 years, had a mean V score of In 
(Swartz, Lara Tapia, & Thorpe, 1967). t 
all HIT studies of normal subjects carried оп 
to date, V scores decline dramatically bie 
age until around 12-13 years, at which ei 
they rise again. The qualitative nature of E 
categories of Pathognomic Verbalization be 
strikingly different in V scores produced n 
puberty, however, With increasing age 10 E 
been found that there is a significant incre 
in the number of children giving fabul a 
responses and significant decreases 2n 
number giving autistic logic, contaminate® 
and absurd responses.? elop- 

In concluding his discussion of ee 
mental studies using the HIT, Gamble (1 
suggests that a next logical step 


e 
MES old a£ 
would involve testing subjects in middle and " 


to determine whether the HIT is sensitive to 
verse trends of decreasing differentiation an 
archic integration predicted by organismic 
mental theory [p. 180]. 


hier- 
eveloP~ 


F vex has 
The study of middle-aged subjects alrea Da e 
been carried out (Witzke, Swartz, ge 


z " 0 о 
1971), and the study of subjects in re 


^c ition 
currently is in progress? In m d Pes 
cent investigation of the relations pt pat 
mental age and the several ИГУ нр ul n 
have proved to be reliable and m t also r 


dexes of perceptual developmen 


r 
sa nO 
n 
ions 1 oral 
atio actor 


NA 
dy 


2], D. Swartz. Pathognomic uu 
mals, psychotics, and mental zen B ustin) 
dissertation, University of Texas BS 1969 
Arbor, Mich.: University Micro 
70-10, 872. Я 
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vided strong support for these HIT variables 
às indexes of perceptual development and con- 
firmed the nature and the direction of this 
development (Swartz, Cleland, Drew, & 
Witzke, 1971). 

The clarifications that need to be made re- 
garding the relationship of the HIT with the 
Rorschach involve the HIT scores of Move- 
ment, Human, and Color: 

1. Gamble (1972) states that “Rorschach 
Human Movement (M) is | HIT] Movement 
Scores 2 or higher [p. 175]." In fact, Ror- 
Schach Human Movement could be HIT 
Movement 1, 2, 3, or 4, depending upon the 
Tesponse. In the HIT scoring system, a Move- 
Tent score of 1 is reserved for those responses 
With “static potential for movement" (sitting, 
looking, etc.) (Holtzman, Thorpe, Swartz, & 

erron, 1961, p. 51). According to at least 
9ne Rorschach authority, such potential would 

© scored Human Movement since “human 
Movement is used to score concepts involving 
any kind of humanlike action, posture, or ex- 
Pression | Klopfer & Davidson, 1962, p. 70].” 
Related to Point 1, Gamble then states 
that “Rorschach Entire Human (И) is 
[HIT] Human 1 or 2 [p. 175].” This state- 
ment also is in error, Rorschach Entire Hu- 
man would most often be HIT Human 2 
While Rorschach Parts of Human Figures 
(Ha) would most often be HIT Human 1. 
Tost other Rorschach content scores related 
to human content [ (1), AH] also would be 
Scored HIT Human 1. 

3. When Gamble writes further that “Ror- 
Schach FC, CF, or C would correspond to the 
number of [HIT] chromatic (italics added) 
“ards coded 1, 2, or 3, respectively, for Color 
Р. 175],” he is also at least partly in error. 
n the HIT scoring system no distinction is 
iade between chromatic and achromatic cards 
n Scoring Color. It is quite possible, there- 
°re, for HIT Color scores to be given to com- 
d achromatic cards. It is true that Ror- 
lc Ach FC, CF, or C probably would cor- 
ino od very closely to the number of re- 
ома to chromatic areas of HIT cards 

ed 1,2, or 3, respectively, however. 
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4. Finally, two minor errors of a typo- 
graphical nature might lead to some confusion 
if not corrected: (a) In listing the nine qual- 
itative Pathognomic Verbalization (V) cate- 
gories in the HIT Scoring system, Gamble 
(1972) refers to “deteriorated responses [p. 
174]." Most V scores, regardless of qualita- 
tive category, can be thought of as “deterio- 
rated." Gamble intended to write “deteri- 
orated color" responses. (b) When Gamble 
writes “Rorschach Whole (W), Determinants 
(italics added) (D), or Common Detail (d) 
would be [HIT] Location 0, 1, or 2, respec- 
tively [p. 175]," he do doubt meant to write 
"Large Usual Detail" instead of *Determi- 
nants.” 
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PROBLEMS IN 


THE MEASUREMENT OF COGNITIVE DEFICIT". 


LOREN J. CHAPMAN? ахо JEAN P. CHAPMAN 


University оў Wisconsin 


Several dozen studies oí differential cognitive deficit appear each усаг. Artifacts 
cloud the findings in a majority oí these studies because of differences in dis- 
criminating power of tasks coupled with generalized deficit of the patients. 
With two tests of differing reliability, the test with the higher reliability will 


yield greater performance deficit for the less able subjects. We 


argue for the 


advisability, in studies of differential cognitive deficit, of matching tasks on 


reliabili 
the distribution of item difficulty, 
ization group. 


J 


Most hypotheses about cognitive deficit are 
tested in designs in which two or more error 
scores are compared. We show that in such 
studies artifactual findings are produced by 
differences in test characteristics which affect 
discriminating power. We illustrate these 
principles by studies of schizophrenic thought 
disorder, although the principles apply 
equally well to hypotheses about all other 
cognitive pathologies such as those of brain 
damage, drug states, and mental deficiency. 

Most investigations of cognitive deficit are 
concerned with specific deficits. A single score 
per subject is not sufficient to measure a spe- 
cific deficit because subjects with cognitive 
pathology typically show generalized cog- 
nitive deficit, that is, low scores on almost 
any measure of intellectual functioning. 
Therefore, meaningful statements about spe- 
cific deficit must be in terms of differential 
а 
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ү, shape of the distribution of scores, and mean, variance 


and shape of 


using normal subjects alone as a standard- 


deficit, that is, a greater loss in one ability 
than in one or more other abilities. 


DISCRIMINATING POWER AND MEASURES 
OF DIFFERENTIAL DEFICIT 


The degree of differential deficit in ability 
is inferred, of course, from a comparison 9 
the extent to which two measures discrimi- 
nate pathological performance from pur 
performance. However, the extent of the E" 
feriority of score of pathological subjects 
pends not only on their deficit in ability bY 
also on the discriminating power of the test 
"Discriminating power" refers here to (he 
extent to which the score differentiates 
more able from the less able subjects n 
hence, differentiates two groups that D^ , 
the ability measured by the test. Tests 3 
identical discriminating power if they ue 
identical distributions of error-free - 
scores, If deficient subjects are as m e 
normal subjects on one ability as on ao the 
but the test that is used to measure one е test 
abilities is more discriminating than t ficit 
of the other, a greater performance inatin 
will be found on the more discrim! f 
measure. е 0 

For tests of the same format and er di- 
scoring (e.g., free response ee pow 
chotomous scoring), the discriminat! | culty 
of a test is a function of mean item item eA 
dispersion of item diffculty, E йг, 
variance, and number of items. va gue 
on item difficulty. In many pue tasks wa 
of cognitive pathology the pairs 9 ways wht 
differ sharply on item difficulty 
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could easily have produced the results of the 
Study artifactually. 

T Other things being equal, items closer to the 
50% level of difficulty for all subjects tested 
have the highest discriminating power on a 
dichotomously scored free response task com- 
Posed of items on which guessing is not likely 
to produce the correct answer (Lord, 1952b). 
On multiple-choice tasks, the highest dis- 
Criminating power is expected for accuracy 
levels slightly higher than midway between 
chance and 100% accuracy (Lord, 1952a). 
_ The effect of item difficulty on discrimina- 
lion of groups of differing ability can be il- 
lustrated by comparing two age groups of 
children on vocabulary items that differ in 
difficulty, Figure 1 shows the percentage ac- 
curacy of a group of 34 third-grade children 
and a group of 39 fifth-grade children on the 
Vocabulary items of the Stanford-Binet In- 
telligence Scale, The abscissa of Figure 1 
Presents six sequential groups of vocabulary 
items, each representing a different level of 
difficulty (as measured by percentage ac- 
Curacy for third and fifth grades combined). 
©. Each Point on the abscissa represents several 
| Vocabulary items of a given level of difficulty. 


Sainst this abscissa are plotted curves for 
be Percentage of third-grade children and the 
Set centage of fifth-grade children passing each 

~ Of items, (If the two grades were com- 
ыле, the resultant curve would represent a 
s dle plotted against itself, except for the 
seis tly different scale conventions of the ab- 

55a and ordinate.) The curves for the two 
Brades tend to join at the extreme ends of the 
mension of difficulty, and they attain great- 
ey Paration for items at about the 50% 
ined п difficulty for the two groups com- 
disc; This relationship between difficulty and 
ny ve AR power may be generalized to 
chiding groups that differ in accuracy, in- 
ne s pathological and normal subjects. 
be E illustrate how differences in difficulty 
exami, mistaken for differential deficit, let us 
Cott ne а study by Truscott (1970). Trus- 
je, Pared schizophrenic and normal sub- 
cipe four tasks, including recall of random 
h D. Strings anq recall of normal sentences. 
} Wong 'UDS differed less on recall of random 


rd 
8 
trings than on recall of normal sen- 
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Fic. 1. Accuracy of third- and fifth-grade children 
on Stanford-Binet vocabulary items of gradated dií- 
ficulty. 


tences, Truscott concluded that schizophrenics 
show a lack of facilitation of recall by 
normal syntactic constraints. 

However, her results may instead have been 
at least in part an artifact of the levels of 
difficulty (and, hence, discriminating power) 
of her two tasks. Truscott's tasks were very 
difficult (corresponding to the left-hand side 
of Figure 1). Recall of random word strings 
was at about 20° accuracy for the two 
groups combined and recall of sentences was 
much closer to 50% accuracy. If Truscott had 
used shorter word sequences for both tasks, 
she would presumably have sampled items at 
the easy end of the difficulty dimension (cor- 
responding to the right-hand side of Figure 
1). Because random word strings are often 
more difficult than sentences, they would then 
have been closer than sentences to the middle 
range of difficulty. Hence, Truscott would 
probably have found the opposite result, that 
is, a greater deficit in recalling random word 
strings than in recalling sentences. 

In many studies one cannot judge the direc- 
tion of the probable artifact because it is not 
clear which task is more discriminating for 
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normal subjects. If the items of both tasks 
are very easy, most normal subjects make 
almost no errors on either task, but a less able 
group that makes more errors might find one 
task more difficult than the other. Still other 
studies use tasks for which adequacy of per- 
formance is defined by criteria other than 
number of errors, such as by speed of re- 
sponse, as in reaction time tasks, or trials to 
criterion, as in verbal learning. If the tasks 
are not matched on these criteria of difficulty, 
the direction of the probable artifact is am- 
‘biguous because the most discriminating level 
of performance is unknown. One may never- 


theless be sure that differences in difficulty 
pie serena discrimination of groups. 


How то MATCH Tests on 
DISCRIMINATING POWER 


In conventional psychometric theory, re- 
liability is the index of discriminating power 
in that the square root of the reliability is the 
correlation of obtained scores with error-free 
true scores. Yet equivalent reliability is not 
sufficient by itself for matching tests in studies 
of differential deficit because of differences in 
general ability level between the standardiza- 
tion sample and the experimental subjects. 
The rationale follows. 

The proper standardization sample consists 
of normal subjects only. Tests differ in dis- 
criminating power to the extent that they 
distinguish the more able from the less able 
subjects among all subjects tested, that is, 
the normal and pathological subjects treated 
as a single group. Yet, one would not wish to 
match tasks for the two groups combined be- 
Cause reliability of the tests is affected by any 
true differential deficit due to pathology. The 
task with the greater schizophrenic deficit will 
have the larger variance for the two groups 
combined, and hence, the higher reliability. 
Matching on reliability for the two groups 
combined would eliminate some or all of this 
evidence of differential deficit in ability. 

Tasks should instead be matched using nor- 
a Subjects of a wide range of ability. If the 

S are developed before the experiment 
ae the &roups in the experiment will prob- 
Maar diferent range of ability than the 

zation sample. As a result, mean item 
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difficulty of the tests will be different than in 
the standardization sample, so that tests that 
had been matched only on reliability and not 
on item difficulty would probably become un- 
matched on reliability for the new groups. 
However, if tests are also initially matched on 
item difficulty, they should usually remain 
matched on reliability for a normal group of 
a different mean ability level. The match 
should be on mean, variance, and shape of the 
distribution of item difficulty. Comparable 
shapes of the distributions of test scores are 
also necessary if scores from the two tests are 
to be meaningfully compared. The matching 
of the two tests using normal subjects of à 
wide range of ability will usually result E 
the two tests retaining equivalent reliabilities 
for normal subgroups of different levels i 
ability. This equivalence must be verified к 
any particular group of pathological ао 
by recomputing test characteristics e * 
normal sample which scores at the leve cé 
the pathological group. With such equivale 
any difference between the two test scores 


f to 
pathological subjects can be attributed 
genuine differential deficit in ability. pové 


Despite fairly close matching on the А 
variables, tests may differ slightly оп mean ү 
variance for normal subjects. Tests тау Ws 
differ slightly on number of items. Tf, in pad 
cases, the two distributions of scores wert 
similar shapes, one might reasonably pc 
all scores to standard scores on the be iza- 
the performance of the normal stan or dif 
tion sample. This conversion corrects ari- 
ferences between the tests in mean an 
ance and facilitates comparison. (It © a 
of course, correct for differences betwe 
in discriminating power.) 


ot; 
oes 2 
1 tests 


ILLUSTRATIVE EXAMPLE 


ing 
А no 
crimina. 


The need for equivalent dis re 
power may be illustrated by rt 
cently gathered by the present yi ^it Р 
item multiple-choice analogies os й 

of gradated difficulty was given ^ normal 
disturbed schizophrenics and 20 pe 
jects. This test was used as а Р ; 
from which various subsets PR and ! 
drawn to compare schizophrer > pa 
performance for purposes 0 


а 
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114 items were arbitrarily divided to yield 
two forms. Form A consisted of the 62 items 
for which the first word of the analogy began 
with a letter that occurs in the alphabet be- 
tween A and L. Form M consisted of the re- 
Maining 52 items for which the first letter of 
the word fell between M and Z. Normal sub- 
Jects scored about the same on the two forms, 
and the schizophrenics showed about the same 
Performance deficit on one form as on the 
Other, 

The effects of difference in reliability on 
Performance deficit may be illustrated by 
drawing subtests from the two forms with 
identical means but different reliabilities for 
norma] subjects. Coefficient alpha (Kuder- 
Richardson Formula 20) was our measure of 
reliability. As shown in Table l, a pair of 
Subtests was drawn with a coefficient alpha of 
80, One subtest from each form. A second 
Pair of subtests was drawn with lower coef- 
ficient alpha values, .47 for the subtest from 
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Form А and .49 for the subtest from Form M. 
These four subtests were closely matched on 
vow accuracy, The more reliable subtests 
fe Y» lower variances of item difficulty and 
pedis ao test variances, as might be ex- 
the a 4 Table 1 shows the mean accuracy of 
he "Zophrenic and normal subjects for 
т Subtest. The schizophrenics showed, as 
the cted, a greater preformance deficit on 
Subtests with the higher reliabilities, 
Whether from Form A or M. Thus, a com- 
Parison of the schizophrenic performance on 
ая less reliable Form М subtest with the 
ore reliable Form A subtest yields poorer 
Schizophrenic accuracy on Form A than on 
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Form M (¢ = 4.89, p < .001). On the other 
hand, if one should choose the more reliable 
Form M subtest and the /ess reliable Form A 
subtest, one would obtain the opposite find- 
ing, with the schizophrenics scoring lower on 
Form M than on Form A (t= 432, p< 
.001). These results show how easily differ- 
ences in reliability may artifactually vield an 
apparent differential deficit in ability. If the 
two forms were designed to measure different 
abilities, as in the usual study of schizophrenic 
cognition, the investigator could easily con- 
clude from performance on either one of 
these pairs of subtests that he had demon- 
strated a differential deficit in ability. 

Let us now examine data that show how 
differential difficulty of items can produce dif- 
ferences in discriminating power of tests de- 
spite equivalent reliabilities (both difficulty 
and reliability again measured by normal per- 
formance alone). These data are from a free 
response vocabulary test that was given to 52 
schizophrenics and 56 normal control subjects, 
some of the items being affect laden (emo- 
tional) and others being affectively neutral. 
Table 2 shows the performance of schizo- 
phrenic and normal subjects on two 10-item 
emotional and two 10-item neutral subtests 
with the coefficient alphas around .70. Two 
subtests, one emotional and one neutral, were 
chosen from items of medium difficulty and 
two others, one emotional and one neutral, 
from very difficult items. The items of medium 
difficulty were of about 65% accuracy for 
the normal subjects alone and about 5196 
for the two groups combined. The very diffi- 
cult items were of less than 1096 accuracy for 


TABLE 1 


RELATIONSHIP OF 


SCHIZOPHRENICS’ PERFORMANCE DEFICIT TO TEST RELIABILITY 


| 


Е M n8 Рае | Accuracy 
| | zi 
Subtest Suntec? | Gee | yarns | Normals Schizophrenics 
| ` items alpha | ОНУ 
| | 
| | | | Y | 5р x | SD 
High reliability | | | | 
з ability 5 | 
"orm A 3 | 10 | .80 | 02 | 6.3 | 28 | 30 | 28 
iam orm M 10 | 80 03 6.4 | 27 | 80 2.6 
W reliability — | | NES | un Е 
"Огт A | 10 AT .08 6.5 | 1.6 | 2 
Form M | io | 49 08 63 | 17 4.6 24 
E l mm | G L — 
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TABLE 2 


RELATIONSHIP OF SCHIZOPHRENICS’ PERFORMANCE DEFICIT TO TEST Dire 
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| Accuracy 
| | | Vari B EE Е MES 
У er of Um. | ойша Schizophrenics 
Degree of diede | шшер | | ivem dif- = eem — Wesen _ 
| | | x SD x SD 
| | 
Medium | | | | i 
Emotional | 10 | .68 904 6.4 | 13 Ав d 
Neutral | 10 .68 04 6.6 2.2 3.8 
m | ; | : 1.7 
Emotional | 10 73 -003 zi 1.4 2 
Neutral 10 kil -003 7 1.3 s 


each group. The schizophrenics, as expected, 
did not differ from the normal subjects on the 
very difficult subtests (¢ = .32 for emotional 
and .72 for neutral), but showed a larger 
deficit on the easier subtests (t= 4.76 for 
emotional and 5.84 for neutral; р < .001 for 
both). The investigator who compared groups 
on one emotional and one neutral subtest 
might conclude that he had found a greater 
Schizophrenic deficit in emotional than in 
neutral vocabulary, or vice versa, if he chose 
subtests of differing levels of difficulty. 


ALTERNATIVE Ways ro RULE OUT 
DIFFERENTIAL DISCRIMINATING 
POWER AS AN ARTIFACT 


In some situations, tasks unmatched on dis- 
criminating power may give legitimate evi- 
dence of differential deficit in ability. If the 
control task is a more discriminating measure 
of nonpathological differences in ability than 
the experimental task, and if, despite this 
disparity, the experimental task yields the 
Steater difference between pathological and 
normal performance, one must attribute the 
differential performance to a true differential 
deficit rather than to a generalized deficit 
coupled with a difference between the tasks 
on discriminating power. For example, the 
00 reliable subtests of standard intelligence 
€sts are usu iscriminati 
less rely dur Чайан BN 

) xperimental tasks 


that are used in ; 
most stud i 
defit. udies of pathological 


A less 
used j 


| Adequate approach that is sometimes 


О select atypically poor-scoring con- 


trol subjects so as to match pathological um 
normal groups on accuracy on the eu. 
task. This method is often complicated by e 
problems of statistical regression. One wt 
select atypically good-scoring pathologie 
subjects and atypically poor-scoring "D 
subjects on the control task to ааа. 3 
matching, and these atypically scoring ‘ihe 
groups usually regress toward the means Ч rie 
larger groups on their scores on the par 
mental task. Matching subgroups may n 
produce systematic unmatching on other e Ў 
ables, as recently discussed by Meehl (19 
The usefulness of findings from a study US" = 
such matching ranges from fair to aaa > 
pending on how extremely atypical the 
jects must be to achieve the matching. 


STATISTICAL SOLUTIONS е. 
npted 5 + 
of dem? 
р differ- 
ions 177 
t 


Several investigators have atter 
tistical solutions to the problem 
strating differential deficit with tests O 
ing reliabilities. These statistical solut d 
clude (a) analysis of covariance tO scores 
out the scores on the control task Пт analy” 
on the experimental task, or regressio” ^ (sk 
sis to estimate scores on the experime? 
from score on the control task, an sl 
sign a residualized error score to X. sco 
and (5) estimation of їрєгєн е me 
of reliability. Neither of these me Sud 
quate. In each case the more en a 
test will still yield a greater, р 
deficit than the less discriminant we 
given to a group of subjects with 
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than the control group. A statistical solution Lorp, F. M. A theory of test scores. Psychometric 
to the problem may be possible, but we are _ Monographs, 1952, No. 7. (b) 


dab Meerut, P. E. High school yearbooks: А reply to 
not aware that one has yet been developed. Schwarz. Journal of Abnormal Psychology, 1971, 


77, 143-148. 
REFERENCES Truscott, І. P. Contextual constraint and schizo- 
оар: B уе " TE phrenic language. Journal of Consulting and Clini- 
RD, F. M. The relation of the reliability of mul- cal Psychology, 1970, 35, 189-194. 
liple-choice tests to the distribution of item dif- И 
ficulties. Psychometrika, 1952, 17, 181-194. (a) (Received May 22, 1972) 
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IS THERE ANY PROPRIOCEPTIVE FEEDBACK? 
COMMENTS ON SCHMIDT (1971) 


BILL JONES 1 


University of Queensland 


Schmidt argued that experiments by himself and his co-workers support the 
theory that the timing of skilled movements is based upon proprioceptive 
feedback. The author argues that Schmidt’s account oí timing is too narrow 
to describe skilled performance and tries to show that the experiments Schmidt 
relies upon do not rule out an alternative theory of timing in terms of central 
monitoring of efferent signals to the muscles. 


Any skilled task demands that the central 
nervous system have the ability to control the 
temporal ordering of a pattern of movements, 
Even a "discrete" voluntary movement re- 
quires at the least the coordinated regulation 
of two antagonistic muscle groups at a joint. 
This central capacity to order the temporal 
sequence of movements is ordinarily referred 
to as "timing" (Bartlet, 1947; Lashley, 
1951). Schmidt (1971) has argued that tim- 
ing depends upon feedback from propriocep- 
tors in joints, tendons, and muscles. Schmidt, 
in practice, equates timing with timekeeping 
or “time estimation,” and most of the experi- 
mental paradigms he reviews are concerned 
with such discrete acts as pressing a key or 
Switch. However, his paper implies that he 
wishes to place his proprioceptive feedback 
theory in the wider context of skill (i.e., any 
sequence of movements which meets some 
criterion of correctness). For example, he 
argues (p. 383) that the high levels of skill 
displayed by well-practiced subjects depend 
upon the timing (ie., temporal ordering) of 
the various submovements, and in contrasting 
his feedback or input theory with a prepro- 
gramming hypothesis, Schmidt says that the 
Preprogram “dictates the timing of the vari- 
ous submovements in a sequence [p. 392].” 

In the experiments discussed by Schmidt, 
the subjects task was normally to respond to 


questions such as, At which time shall I press 
the key, which required him to estimate a pe- 
riod of time given, it is assumed, proprio- 
س‎ 


1 
ا‎ for reprints should be sent to Bill Jones, 
ан of Psychology, University of Queensland 

ia, Queensland, Australia 4067. i 


ceptive cueing. In contrast, consider any 
transcription skill such as typewriting ле 
the subject's task is not to estimate the pas 
sage of time in order to control the sequence 
of movements. Since he has not merely 
to press any key but to press the correct - 
in the sequence, the subject needs to kno / 
the extent and direction of previous —" 
ments in the pattern. In short, the skill s: 
quires accurate placement of the fingers, # 3 
information as to position of the fingers ee 
prerequisite of timing. It is, of course, a e: 
batable point, whether such information er 
from proprioceptive feedback or through ш 
monitoring by the central nervous syste™ 
its own motor outflow (i.e., preprogramming 
and in a brief note, it is hardly possible p 
view the evidence for either position. It p 
simply be noted here that there is strane cu 
dence which would be difficult to o 
with any existing proprioceptive input 1971; 
(see e.g., Angel, Garland, & Fischler, 
Higgins & Angel, 1970; Jones, 1972). T of 
If we adopt Schmidt’s restricted 4528 
timing as time estimation, it does not и 50 
me that the experiments he discusses Т osed 
the central monitoring hypothesis as s sk 
to proprioceptive feedback model. vp short 
which required subjects to esme t 
time interval by moving a slide n 1969) 
duration over a given distance, E estimatio” 
found that absolute error of time 7 te 
decreased when acceleration, vel to m 
sion cues were manipulated pad т б. 
resistance controls, being most sorte? P 
manipulations of acceleration. three A 
jects were required to smell two: Joe 
four-letter words with auditory 


eem 


cues 
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by white noise, and time estimation was more 
accurate with four-letter words. When spelling 
was withdrawn, there was a marked decline in 
accuracy of time estimation for the no-re- 
sistance controls with only slight decreases for 
the resistance groups. 

_ Schmidt (1971) argued that spelling pro- 
vided proprioceptive feedback from the lar- 
yny, lips, and tongue. However, it may equally 
well have provided a self-pacing strategy via 
motor outflow to the larynx, etc., since, of 
Course, if spelling is discontinued, not only is 
there an end to feedback signals from the 
larynx but there is also an end to motor out- 
flow to the larynx. It is perhaps worth noticing 
that proprioceptive signals from the larynx 
do not provide the singer with sufficient in- 
formation to control pitch since accuracy of 
Pitching declines when auditory feedback is 
Withdrawn (Deutsch & Clarkson, 1959). It 
May be argued that Deutsch and Clarkson are 
dealing with continuous graded activity 
esas Schmidt is arguing about a discrete 

Sponse, However, the so-called discrete re- 
Sponse stil] involves temporal patterning of 
те contractions and may therefore be 

"pared with “continuous” responses. 
ines results in the study by Schmidt and 
ШП: (1969) did, in general, indicate 
lens th a higher levels of movement (Schmidt 
feed] the question by saying proprioceptive 

back) provided through rotation of a 
s the subject's ability to anticipate the 

Пе of arrival of a signal was facilitated. The 
Subject was required to rotate a lever with the 
eft hand over a time course of 1.5 seconds, 
he time taken by a moving pointer, which 
Was started by the subject's action on the 
“Ver, to reach a stationary target. The sub- 
Ject’s main task was to release a reaction-time 

€y with the right hand when pointer and 
arget became aligned. 

e were three feedback conditions: in- 
Mirum and high, in which the subject 
NI €d the lever through 3.5 inches and 11.5 
Bb respectively, and minimal, where the 

Ject merely depressed a switch with the 

hand to activate the moving pointer and 

n rested his arm on his left knee. In the 

ain, lower reaction times were obtained with 

© higher levels of movement. Propriocep- 
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tive-feedback manipulations may, therefore, 
be important in increasing accuracy of ve- 
locity control and so provide a cue for time 
estimation (but that is not necessary evidence 
that proprioceptors are involved in the con- 
trol of limb placement within temporal pat- 
terns). 

On the other hand, since subjects made 
voluntary movements there must have been 
signals from the central nervous system to the 
muscles which could have been centrally mon- 
itored. Schmidt and Christina’s results may 
only indicate that such monitoring is facili- 
tated by increasing the amount of movement 
involved. The central theory could only be 
ruled out if Schmidt and Christina’s results 
were replicable with passive rather than ac- 
tive carriage of the left arm. Since subjects 
were, in fact, free to move as far as they 
pleased, the end point of each movement 
could have been preprogrammed, that is, de- 
termined by patterns of outflow within the 
central nervous system rather than propriocep- 
tive inflow and the right-hand response cued 
on а central rather than peripheral basis (see 
Jones, 1972). 

To summarize, Schmidt (1971) implies that 
studies of timekeeping throw light on the co- 
ordination of skilled movements and, more 
specifically, that proprioceptive feedback is 
involved in timekeeping. However, it is clear 
that the coordination of a temporal sequence 
of movements in a skill requires much more 
than time estimation; it requires knowledge 
of direction and extent of previous movements 
in the sequence since movements must be ac- 
curate, Second, the studies which Schmidt 
(1971) regards as central to the propriocep- 
tive feedback theory of timekeeping (Chris- 
tina, 1970; Ellis, 1969; Schmidt & Christina, 
1969) do not, in fact, provide evidence that 
proprioception is necessarily involved in time 
estimation since if subjects make voluntary 
movements, there must be central nervous 
system outflow which could serve as an alter- 


native basis for time estimation. 
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PROPRIOCEPTION VERSUS MOTOR OUTFLOW IN TIMING 


timing, the proprioceptive feedback exp 


Rather than the proprioceptive feedback in- 
terpretation for some of the data I reviewed 
in my paper (Schmidt, 1971), Jones (1973) 
Prefers an interpretation wherein motor out- 
How to the muscles provides a basis for the 
timing of future responses. Jones’ point is that 
the data discussed in terms of the propriocep- 
tive feedback input hypothesis did not rule 
Out the possibility of motor outflow, and that, 
therefore, the motor outflow view was also de- 
Serving of attention as a possible explanation 
for that body of data. Strictly speaking, he is 
Correct, о 7 

While space does not permit an extensive 
ш of the motor outflow and proprio- 
Sram feedback hypotheses for movement 
iubes , it should be pointed out that, using 
o ines responses, there seem to be two lines 
is Vidence for the outflow hypothesis which 
th anot be handled by other theories. First is 

* evidence by Angel, Garland, and Fischler 
Horn) who showed that subjects could cor- 
e errors in step tracking without peripheral 
Cedback. Second, the evidence by Taub and 
ES man (1968) that monkeys could learn 
dic Or responses after deafferentation may 1n- 
in RT that some form of feedback was mediat- 
Spee These studies point to the pos- 
En of internal monitoring of the motor 
Es to the muscles via a rapid feedback 
рар as a mechanism in motor learning and 
de Огтапсе. To the contrary, that a subject 
Deri 8 movement in the absence of e 
i М eral feedback (e.g., Lashley, 1917) an 
EC the accuracy of duplication is a function 
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A REPLY TO JONES 


RICHARD A. SCHMIDT 
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Jones’ point that the motor outflow hypothesis should be considered a rival 
to the proprioceptive feedback interpretation of the motor response timing 
data reviewed by Schmidt has potential merit. However, with no data ruling 
in favor of motor outflow over proprioceptive fec 
other peripheral feedback channels (e.g, 


dback for timing, and with 
vision) known to be mechanisms for 
nation is favored over motor outflow. 


of active versus passive presentation (e.g., 
Jones, 1972) can be handled easily by the 
notion of the motor program (see Keele, 
1968; Schmidt, 1972; Schmidt & Russell, 
1972). The problem with all this evidence for 
the present question is that it deals with the 
spatial aspects of movement production and 
error correction, whereas the present issue con- 
cerns mechanisms for determining the time of 
initiation of these sequences (e.g., the time to 
initiate a swing so as to hit a pitched ball). 
Of course, the possibility of internal monitor- 
ing for one class of behavior does not provide 
assurance that internal monitoring is involved 
in timing. 

In sharp contrast to the suggestion that 
outflow is involved, there is strong evidence 
that peripheral information plays a powerful 
role in timing. Consider vision, for example, 
in a situation where the subject must respond 
exactly 2.0 seconds after a starting signal 
while either watching or not watching a clock. 
Some of our unpublished evidence shows that 
with vision, subjects make absolute errors of 
milliseconds, while without vision 
(e.g., Quesada & Schmidt, 1970) the mean 
absolute error is about 100 milliseconds. 
Clearly inflow from the periphery can aid 
timing. Since there is no logical necessity to 
assume any fundamental difference between 
proprioceptive feedback and the other senses 
(other than, of course, those differences in- 
volving the specific receptors or the types of 
physical stimuli detected), it would seem that 
Jones’ (1973, p. 5) suggestion that the pro- 
prioceptive feedback hypothesis could be ruled 
out if passive movements had been employed 
in the Schmidt-Christina (1969) experiment 


only a few 
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ld be certain to support the proprioceptive 
feedback hypothesis. What is the difference 
betwe having a subject watch a clock or 
having his limb moved predictably for 2.0 
seconds? Both provide reliable sensory infor- 
mation about the passage of time, and pro- 
prioceptive feedback would almost certainly 
be superior to no feedback at all. In fact, we 
(Schmidt & Christina, 1969) did not use pas- 
sive movements because we wanted to rule out 
this explanation, as we were interested ло! in 
w. the subject could use proprioceptive 
Í to time (which seemed obvious) but 
whether, in fact, he did use the feedback, Our 
findings indicated that he did, although we 
must admit the possibility that they could 
have been using motor outflow information. 

"Therefore, since there is the possibility that 
a motor outflow mechanism may be involved 
in timing as it may be with motor control 
situations, and since the timing data do not 
strictly rule out this view, Jones’ point that it 
should be considered a rival to the propriocep- 
tive feedback hypothesis is well taken. How- 
ever, because there is support that peripheral 
information is strongly involved in timing, and 
there is no direct support that a motor out- 
flow mechanism is involved, I naturally prefer 
to interpret the timing data in proprioceptive 
feedback terms. The answer to Jones’s ques- 
tion, “Ts there any proprioceptive feedback?” 
seems to be “Yes” for timing. We now anx- 
iously await solid data which answer affirma- 


tively the question, “Is there any outflow in 
timing?” 


RICHARD A. SCHMIDT 
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COMPU" ATIONAL PROCEDURES FOR ESTIMATING MAGNITUDE 
OF EFFECT FOR SOME ANALYSIS OF V, \RIANCE DESIGNS 


DAVID H. DODD! Ах» ROGER F. SCHUL 


University of Ulah 


Computational procedures are presented for determining estimates of magnitude 
of effect or proportion of variance (w?) for a variety of analysis of variance designs. 
"Tables summarize simplified computational formulas for fixed, random, and. mixed 
designs, including both nonrepeated- and repeated-measures cases. 


Vaughan and | Corballis (1969) have dis- surement, overlining is employed, for example, 
cussed the problem of estimating magnitude of A. Since subjects (a random factor) always 
effect in various analysis of variance (ANOVA) becomes a factor in repeated- -measures designs, 
designs. Their paper emphasized the import- the letter S is included in the symbol for such 
ance of reporting measures of degree of associa- designs, for example, ABS isa design having 
tion between the independent variable(s) and Fixed Factors A and B and repeated ı measure- 
the dependent variable. Most significantly, it ment on A. 

Makes available a general technique for finding 


Such estimates, a technique based directly on TNR а И А 
€xpected mean squares. The technique can be ER lacions Нах NE NO 

üpplied to virtually any ANOVA model, REPEATED MEASURES 

although certain designs do not, in general, The computational formula for these designs 
allow unbiased estimates, is given by Hays (1963). Regardless of the 


peace Local a oe fe number of factors, estimates are obtained by 
some frequently aan Eine All of essentially the same formula, | | 
the formulas ie denved direste T ihe In the one-way ANOVA, the proportion of 
expected me: UN uae ^ ww. Eno cel "ariance accounted for by the treatments is 
Bub pie squares, E(M.S)s, for each 
of variance. Some algebra yields 
malas Which are generally simpler than 
€ presented by Vaughan and Corballis. In ша = 
add ition, they eliminate the need to determine 


| Se 
| ера arately an estimate of variance for every 


of interest : 


Source. This computation can be avoided where The result is essentially the same for the 


* variance source is not of interest to the two-way ANOVA. This case is used to illustrate 
investigator or is small enough, relatively е entire derivation; consider Factor A. The 
variance estimate 


ime to be ignored. MÀ ees pee {т numerator consists ee the 
Ing ris uh ca ae ee te acre for A given in Table 1. The denominator con- 
common нента т pea many of sists of the sum of all variance estimates. 
Which are not presented in Vaughan and Multiplying numerator and denominator by 
Сота, с пЈК and simplifying yields the results indi- 

The notation used here is basically that of cated. If the F fora factor is relatively low, or 
Vaughan and Corballis. Each factor in a the factor is not of interest, that calculation 

can be omitted. 


Че 
ee is symbolized by an Arabic letter. Fixed а T . 
q padom factors are indicated by capital In general, for any size ANOVA involving 


"id l lowercase letters, respectively. To indicate 
i ч s 
uch п factor (s) is involved in repeated mea- 


fixed factors where none are repeated, the 
computational formula is 


Dea Requests for reprints should be sent to David H. T SSettoct = (ене X Мр) 
| Sale Department of Psychology, Univ ersity of Utah, @oifeet = Жш] E Ms. 5 


Lake City, Utah 84112. 
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TABLE 1 


а = " — "Р тр СРЯ NITUDE 
EXPECTED MEAN SQUARES, ESTIMATES or VARIANCE, AND NUMERATORS FOR ESTIMATING MAGNITUE 


or EFFECT FOR THE Сомрох 


Ts OF A Two-Way Fixep Factor Dr 


d erator f 
oue EOIS) Eae of aiai coda ргосейиё 
| pe 
| nJK (J=1)(MSa—M. T rox Md 
4 2 Ait Sg E л Xt MN SSa = (dfa X Мог 
А с? + nko. o + 35 174 wk | 1 
ү nJK (K—1) (MSy — M Sorco) a CN 
B e d nor =o? + к<” DES m SS = (dfn X MSerror 
nJK „ |U-D(C-DOLSan— M Sess) | oe Sun) 
AB с? + nbat = o + Gano - UK SSan = (dfan X Монт 
Error oè M Setor 
А mnm 


FIXED MODEL WITH REPEATED MEASURES 


For designs in which there is repeated 
measurement on a fixed factor, there is some 
degree of underestimation in estimates of 


FIXED Factor REPEATED-MEASCRE DESIGN NONADDITIVE AND ADDITIVE MODELS 


Note, The denominator for all estimates of magnitude of effect is Хаа + MV Sone 


proportion of variance since the expected value 
of the denominator is somewhat greater than 
the sum of the true variances, In many casen 
the investigator may simply choose to ps 
this bias, which may be relatively small. The 


TABLE 2 


سے 


Degree of bias 


Design Conditions Denominator in denominator 
ERI | 
Nonadditive None 55а + MSs + AMS ans + 92/7 
_ Additive No 4 X S effect | 55 + MSs None 
ABS | P 
Nonadditive | None | a + MSs + nJMS pzs ToU 
.. Additive No B X S effect | мы + MSs None 
ABs аА 
Nonadditive | None | Sunt + MSs + UM Sas + nMSuxs | ше Le 4 nm 
| +n +K – 1) Мавы Ik” F 
Additive (a) No AB X S effect ма + MSs + nMSaxs None 
+ пух — 2nM Sans — 
(0) Nod X S, BX S,or | SSuui + MSs None 
" AB X S effects 
ABCS dn 
Nonadditive None SStotat + WSs + ЛЛК MSexs T oe + 
_ Additive No C X S effect | SSuxi + MSs None 
ABCS | 2 о? 
Nonadditive | None 58а + MSs + АЛМ +nIMSexs| e у T а 
ОК + L 3)M Spess AL S 
Additive (a) No BC X S effect 55а + MSs + АЛ Ses EH 
| canJMSexs — WIMSpexs Өр, 
(0) NoB XS, CXS, or | 55а + MSs None 
BC X S effects» : im _— 
e lee N 
а Pooled MSc = 25428  SSmes + SSanzs Г m a | шог forall F ratios and appe? 
ШР cem баха + dines Fares * The pooled MSc; is used as the denomina cÀ 


© Pooled MSerror = 


which uses MSs) and tl 
Note. Numerators are 4 


or exam 
xs + 5, 
dfuxs + dj 

h а 


LEL 


r 
feet — Gana XMS. 


appropriite error 


x — edu d) issue. 
— The pooled 
n the numerator for all ш 


ot 
1 fe 
aoe (exce? 

ox wii # atone 

is used as the denominator for all F rat 


B _ pM Sene 
pt A), for example, Хун — (К = Uf actions. 
) for the effects of A, B, C, and their in 


direction of bias is such that the investigator 
knows that his effect is at least as strong as 
the obtained estimate. However, as suggested 
by Vaughan and Corballis, the additive model 
| can be assumed, that is, one can assume that 
| there are no inter ‘tions of subjects with 
| 

| 


Tepeated-measures factors. A test of this 
assumption was devised by Tukey (1949) and 
Is described by Winer (1962, p. 217), Myers 
J (1966, p. 166), and Kirk (1968, p. 137). Un- 
ornately, this test is both laborious to calculate 
and restricted to the ass 'ssment of possible 
Mear X Linear interaction components. А 
Second solution is sometimes possible. If, for 
Cach subject, replications on at least one of the 
Tepeated-measures factors сап be obtained as 
Part of the experiment, then an independent 

| timate of error Variance can be made, allow- 
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ing unbiased estimates of magnitude of effect. 
This is not possible in any experimental set- 
tings where secular trends might be expected, 
that is, where changes in the dependent 
variable are associated with the time or trial 
of testing. For example, secular trends are 
expected when learning trials are the repeated- 
measures factor. In some kinds of research 
(e.g., psychophysics), it is both possible and 
even efficient to obtain more measures from 
cach trained subject. The variance between 
the replicates (measures on the same subject 
under a given treatment combination) estimates 
error variance. With replications, the repeated- 
measures, fixed-effect design becomes identical 
to the mixed model; the subjects? variable is a 
random-effects. variable and replications are 
the within-treatments error component. 


TABLE 3 


 CONPUTARONAL COMPONENTS FOR ESTIMATES OF MAGNITUDE OF EFFECT FOR RANDOM AND MIXED Dr: 
Design | n НИ Denominator 
ЕЕ BN oca S 
Random 
і ке * 
| i E J(MSa = MSs) SStout + MS a 
аф al J(MSA — Ман) З 
B K(MSp — MS an) i Хамы + MSa + MSg — MSan 
AB JK(MSan = MSs) 
the A IMSA = MSas — MSac + MSanc) 
B КОМ — MSas — Ме + MS anc) : | . 
e L(MSe — MSac — MSne + MSanc) owt + MSa + М5в + MSc 
AB JK (MSan = MS auc) = MSan — MSac — MSne 
AC JIL(MSac — MSanc) + Мане 
BC KLM Spe = MSanc) 
IBC IKL(MSane = MSs) 
ШҮҮ 
ah 1 УМ = MSs) Ч | 
B Ўр = (dfn X MS in) | SStoat + MSa + JMSag = JMSs 
" AB JK(MSag = MSs) 
авс 
С al FAESA — MSs) 
B SSy — (dfe X MSan) ? è 
С 550 С a X MS) | SS + MSa + JM San + JMSac 
AB IRU Sza — MS ЖУК + L— DM Sane 
AC IL(MSac — MSs) = ЛКФ + DUSK 
BC SSne — (f| ne X MS anc) 
ABC é asc — MSs) 2 
abc 4 JKLUMS anc ISs) J 
í (11S4 = MSan) | : A ү 
а ш | SS Ma + MS diu. 
C SSe — [dfe X (MSac + MSue — MSane]| + IMSac + JM Sac 
АВ — IK(MSan — MSs) Bd dies 
4C — SJL(MSac — MSanc) =f EMSS 


BC KL(MSnpe = MSanc) 
Е. ABG — JELUSaue MSY 


| 
j 
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TABLE 4 


COMPUTATIONAL COMPONENTS FOR RANDOM AND MIXED DESIGNS WITH REPEATED MEASURES m. k 
Design | one | Numerator Denominator 
Random 
aS Solution identical to the ab design. See text. 
abs Solution identical to the abe design. See text. 
abs 1 J(MS4 — MSs — MSan + MSx d М " 
B K(MSg = MSan) | SSow + MSa + MSp— MSA q 
AB ЈК(М5ав = MSuxs) | 
A 5 nJ (MSs — MSpxs) J 
it єй 
у A 55а — [(dfa) X (MSs + MSas = MSpxs)] „аг 
B К(М5в — MSnxs) SStom + MSs + MSp + KMSaB 
AB JK(MSan — MSpys) i = (K + 1)MSuxs 
8 nJ (MSs — MSpxs) 
aBS 1 J(MSa — MSs) (1) Nonadditiv ne 
B 55в — (dfe X MS an) 55а + MSA + IMS a8 
AB JK(MSan = MSnxs) + Ji = 0) Мх fect): 
5 nJ(MSs — Мх) (2) Additive case (no B X S effect): 
55м + MSA + Лав 
= IMSnxs 
aBS A J(MSa — MS дух) (1) Nonadditive case": na 
B 58и — 4/н(М5вхз + MSan — MSanxs) SStot + MSs + MSa — М9 
AB JK(MSan — MS anys) + aM Suxs + JMSan | 
5 n(MSs — MS axs) + (nJ —n—J)M act 
(2) Additive case (no AB X 2)’ ¢ g^ 
58а + MSs + Mga + МЭЛШ 
+ nMSyxs + JMSAB 
= (n+ J)MSanxs on 
a Additive model only: 1 етет аә” 
' Denominator too large by eK. | 
d í 
Table 2 presents the one-, two-, and three- E(MS4) for the random model is: | 
factor designs. The results can be easily ex- " Ke | 
tended to four-factor and larger designs, where те aes uite 
only one or two of the factors are repeated. The Computational formulas are thus not d (ре 
table also provides the necessary information Jike the fixed-ellects results. In generi ots 
2 to the conditions for the additive model, the — numerators for estimates of random-¢ 
degree of bias in the denominator, etc. Note in variables are given by 
particular that for all of these designs the 
numerator is given by (Number of levels of factor) ‘ m 
2 p . pr te 
= X (OMSoia — M Sapirovriate err" " 
ЗУ — UT XM Savproprinte error term); В s rocedure m 
5i The entire computational pro rized ү. | 
E. „the appropriate error term is the random and mixed designs is a went! 
ominator of the F ratio for that effect. Table 3. These results can be react? © 
: ized to larger designs. 
RANDOM AND MIXED WITH No ‚үү | 
REPEATED MEASURES RANDOM AND MIXED DESIGNS 
Consider, for ех ; REPEATED MEASURES mnm 
; for example, a design havi Р EPEATED + g COPA 
random-effects p £n having two " more ggf 


variables. In contrast to 


EMS) for fixed effects in Table 1, the 


Solutions for these designs 


plicated. Whenever the repea 


ar 
ted me 


as 
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is made on a fixed factor, the estimates show 
Soiue degree of negative bias. Because of the 
large number of d signs generated by three 
actors, "Table 4 gives solutions for one- and 
two-factor designs only2 The ās and abS de- 
signs are, as noted, identical to the ab and abc 
designs, respectively. For the a design, simply 
consider that S becomes б and л becomes K, 
then refer to Table 3. For the abS design, S 
becomes c and л becomes L. 


CONCLUSIONS 


It is hoped that the availability of tables 
giving direct computational formulas for the 
Most common designs will encourage re- 
Searchers to include estimates of magnitude of 
elects, Researchers are encouraged to read 
E s 


з А 

* Tables covering the three-factor random and mixed 
Tepeated-measures designs can be obtained by wri g 
to the first 


author, 
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Vaughan and Corballis (1969) ior a more 
complete discussion of a variety of issues and 
for the solution of special problems such as 
specific comparisons between treatment levels. 
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DIRECT VISUAL PERCEPTION: 
A REPLY TO GYR 


JAMES J. GIBSON ! 


Cornell University 


In his criticism of this author’s theory of perception, Gyr did not take into 
account the basic issue of the availability of stimulus information and did not 
consider the radically new conception of proprioception and self-awareness that 


is an essential part of the theory. 


aper in the April 1972 Psychological 

by John W. Gyr (1972) entitled “Is 
a theory of direct visual perception ade- 
quate?” does not, it seems to me, get at the 
deep issues. He objects to my theory of per- 
ception as published in The Senses Consid- 
ered as Perceptual Systems (Gibson, 1966) 
and most recently in “Outline of a theory of 
direct visual perception” (Gibson, 1972). His 
grounds for objection are that I leave out of 
account the internal contribution to the per- 
ception of the external environment, or the 
fact that perception involves the construction 
of an internal model of the environment. 
(This kind of objection can take many forms 
and I think I am aware of all of them.) But 
Gyr also disagrees with my theory of proprio- 
ception, since he follows the ideas of von 
Holst as to how the observer gets information 
about his own activity. He seems to believe 
that this information about action or about 
plans for action provides the internal contri- 
bution to perception, and this I cannot under- 
stand. The problem of the adequacy of ex- 
ternal information is one thing, and the prob- 
lem of how to conceive proprioception and 
self-awareness is another, 

First, as to the problem of mediated or 
direct perception, Gyr failed to consider the 
basic issue: whether there is ло information 
in ambient light to specify the features of the 
world, as classical optics has taught, or 
whether there is. Perception can only be di- 
Tect in the latter case, Tf the specifying in- 
AH e EE: 

em and does not have to 


1 Reprint requests sho: 
should be sent to James J. Gib- 
vs Department of Psychology, Uris Hall Cornell 
iversity, Ithaca, New York 14850. ' 


construct them. If they are ло! available, even 
under favorable circumstances, then the fea- 
tures of the world always have to be con- 
structed in the mind in some way that Wê 
have not yet been able to agree upon. The 
process of extracting invariants from ambient 
light is not understood either, but it is so radi- 
cally different from a process of constructing 
an internal model of the world that it would 
put the study of perception on an entirely 
new footing if we accepted the possibility. 
This is a genuine issue. It is preliminary to 
the study of the act of perception. It concerns 
the very nature of the environment to be pet 
ceived. Tt raises deep questions having 10 1 
with what philosophers call ontology an 
epistemology, but psychologists neglect then 
at their peril. - - Ж 
Next, there are all the problems arising о P 
von Holst’s theory of the “efference CODY e 
Many investigators have recently taken dr 
up, including Held and Festinger and pw 
collaborators. They are century-old Vna wd 
problems on what used to be called "ser ni 
tions of innervation” and *ideo-motor es 18 
But these are legacies of mentalism. Ther gs 
nothing really new in the idea of what sit 
calls the organism “getting information ae 
its own voluntary activity.” He md 
many experiments which show in à E t 
way that perception and action pap*t yi 
separated, but that does not help to 1% 


s supp 
the issue. All these experimenters pre of ре“ 


s " . sis , 
classical sense impressions as the a ist gi” 
ception. If sensory signals are barre om 


ng on motor ре 


ji i use addi 
with, there is no rhe senses WE 


mands. Anyone who considers 


n 

^ a me as 

channels of sensation Йа to i is. w 
К : Р 2 
when it comes to perception. store: ci 


to assume a mind that can copy; 
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Dare, match, decide, and issue commands—a 
man in the brain. 

The whole idea of sensory signals and motor 
commands is wrong, together with the psychol- 
Ogy that goes with it. The brain does not re- 
ceive messages nor does it issue orders. Even 
the use of the terms sensory and motor is 
mentalistic, The concept of a perceptual sys- 
lem is in sharp contrast to the old notion of 
a sensory channel inasmuch as the brain takes 
105 proper place as part of the system and is 
70 longer the seat of the mind. A perceptual 
System is supposed to seek an equilibrium and 
Teach an optimum, as a system should. This 
formula avoids both mentalism and mecha- 
nism whereas formulae like that of Gyr are 
үтү combinations of mentalism and mecha- 

ism, 


tems instead of 
meaning on the 


The theory of perceptual sy 


Senses puts an entirely new 
terms proprioception and self-awareness. It 
Implies 4 rejection of the distinction between 
Proprioceptors and exteroceptors together with 

* doctrine of specific qualities of sensory 


n А 
E and the theory of sense data. It is up- 
etti be 

Ung, but it eliminates a vast area of mud- 


d era m 
* апа confusion in psychology. It assumes 
at there is always available to every per- 
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ceptual system both propriospecific informa- 
tion and exterospecific information at the same 
time. Thus one cannot be aware of the world 
without being potentially aware of the body, 
and vice versa. Each pole of experience im- 
plies the other. If this is true, all the experi- 
ments and observations on proprioception and 
kinesthesis for a century will have to be rein- 
terpreted, for the terms will not mean what 
they they used to mean. A whole set of current 
experiments and controversies will go by the 
board if the modalities of sense are recognized 
as being unimportant for the activity of per- 
ception, for the theory of direct perception im- 
plies an equally direct awareness of the body 
of the observer and the adjustments of its per- 
ceptual organs. New experiments will have to 
be designed to test this theory. 
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LATENT INHIBITION“ 


ROBERT E. LUBOW 2 


Tel-Aviv 


University, Ramat-Aviv, 


Israel 


A review of the latent inhibition literature, concerning the decremental ef- 
fects of nonreinforced preexposure to the to-be-conditional stimulus on sub- 


sequent learning, is presented. 


sheep, rat, rabbit, 


Latent inhibition is found to be a broadly 
based phenomenon appearing across a variety of species: 
dog, human child, and, 


goldfish, 
under special conditions, 


goat, 
the 


human adult, and across a variety of tasks: classical conditioning, avoidance 
conditioning, Ivanov-Smolensky conditioning, conditioned emotional response, 
go/no-go discrimination, reaction time, and conditioned taste aversion para- 


digms. 


and the effects of number of nonreinforced preexposures are examined. 


The stability of latent inhibition as well as its stimulus specificity 


Cur- 


rent explanations of latent inhibition which include the habituation of the 
orienting response, selective filtering, specific antagonistic and complemen- 


tary responses, and conditioned inhibition are discussed. 


The need for a 


combined learning and attention theory is suggested. 


Latent inhibition has been defined as a 
decrement in learning performance which 
results from the nonreinforced preexposure 
of the to-be-conditioned stimulus (Lubow & 
Moore, 1959). Simply, the conditioned 
stimulus presented by itself, before being 
paired with the unconditioned stimulus, re- 
tards the subsequent development of a condi- 
tioned response when the conditioned stimu- 
lus and unconditioned stimulus are paired. 
The purpose of this paper is to describe the 
empirical base which substantiates the ex- 
istence of latest inhibition, in particular, its 
generality across different species and learn- 
ing tasks, and to identify parameters which 
affect latent inhibition. 


SCOPE or Latent INITIBITION 
PHENOMENON 


Distribution across Species 


Latent inhibition has been demonstrated 
in goldfish (Braud, 1971), goats and sheep 


bits (Lubow, Markman, & Allen, 1968 ; 
Reis & Wagner, 1972; Siegel, 19692. 
1969b, 1970, 1971; Siegel & Domjan, 1971. 
Experiment 2), dogs (Konorski & Saw 
kowska, 1952) and, of course, rats (Ackil & 

Mellgren, 1968) ; Ackil, Mellgren, Hi deren 
& Frommer, 1969; Anderson, Merrill, De 4 
ter, & Alleman, 1968; Anderson, О ‘Farrel 
Formica, & Caponigra, 1909a ; Anders 
Wolf, & Sullivan, 1969b; Carlton & Voge 


SR. چ‎ & 
1967; Chacto & Lubow, 1967; Domj: on 
Siegel, 1971; Halgren*; Leaf, Kayser, / Hn 
drews, Adkins, & Leak, 1968; Lubow 


Siebert, 1969; May, Tolman, & PL 
1967; Mellgren & Ost, 1972; Resco " 
1971; Schnur, 1971; Siegel & pania 
1971, Experiment 1), and with some dith їп 
culties, and a number of failures, even : 
humans. The successful demonstrations 

latent inhibition in humans are studies 


аго 
Schnur and Ksir, 1969; Sokolov and P g; 


Én 190 
1 
anova, 1956; Wolff and Maltzman, n 
Hore, 1803 120 Lubow Moore, 1959), fab» is d by Cantor and his associates using di 
and by Cantor : D SUE r% 
due am preparation of this paper was accom- č dren (Bogartz & Witte, 1966; omen 
ned while the author had a vis ing Appoint- z E & Fen? 
шш in Psychology at Bar-Ilan University, Israel, Cantor, 1964, 1965, 1900; Gantor sin 7! 
e o work on the paper was, in part, supported 1968; Miller * 3 Witte & Cantor, wwe 
y National Institute of Health Career Develop- Ss "y Assoc cia ert? 
arem K3 MH-7189, and much of the re- ? Halgren, С. R Latent inhibition: al dis? 
m куз and my students by National ог nonassociative? Unpublished doetor t 
T e Mental Health’ Grant 5 R01 MH- 12307. tion, Indiana University, 1971. f diff p 
or oir S. E. Newman and P. Schnur 4 Miller, F. D. The effects um discri go 
0 
equate E be ign amounts of stimulus familiarization chil dre; Jow” 
ubow, о 1 5 should be sent to К. E. tive reaction time performance n euni ty 
n е 
versity, Ramat-Aviy о Н Ноу, Tel-Aviv Uni- Published doctoral dissertation, Uni 
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TABLE 1 


DISTRIBUTION OF POSITIVE AND N INHIBITION STUDIES 


Species 
Task E — i "n ` p 
uL mr 
F- | 
Classical conditioning | 
Leg flexion | | E 
l'inna or tail response | " | ee | 
Eyelid | | Pa | Ec 
Eie | | | pins 
Salivary | | | | + 
r-bar press | | re 5 | | 
‚ Suppression | ++++ 
ate | path 
р ssion | def | 
Fear-running speed | | E | | 
‚ Suppression | | | 
“ear-avoidance-escape + ttt | Ф 
2ustatory rsion | Bee | 
©, | И | | Bed | | 
»alvanie skin response | n 
vanov—Smolensky | | | re 
Operant-go no-go | ы КӨ 
, discrimination ии 
Ne6sponse speed + + ttt - 
| | ЕЕЕ 


Note, pag. The same number next to more than one entry signifies independent experiments reported in the same article, 
12: Different dependent measures from the same subjects. 


Wittes), With the exception of Schnur and unsuccessful demonstrations of latent 
and Ks (1969), who used an eyelid-con- inhibition across species and tasks. 


ditioni rocedure, а > other human 4 T€ 
nmg procedure, all the « Distribution across Testing Situations 


Studies depart from traditional conditioning i 
Methodology. The failures to find latent Classical conditioning—Pavlovian. La- 
"hibition in studies using human adults tent inhibition has been demonstrated for the 
Sübjects inelude Grant, Hake, Riopelle, and leg flexion response in goats and sheep 
Sostlan (1951), Grant, Hake, and Schneider (Lubow, 1965; Lubow & Moore, 1959) ; tail 


(1948) Perlmuter (1966), Allen," and movement in the rat ( Chacto | m 
Schnur, all of which did use the eyelid-con- рн en a we sa 
Шой де naradi. 2 1 Meyer and Joseph bow, Markman, & Ё ); ә 
g paradigm, and Meyer a › Н : à wh 
(196g) who en a reaction time procedure. мешии d cu gr ber 
Ta й ен ий f success ner, 1972; Siegel, a. 1 Я Я 
‘ble 1 shows the distribution of successful К баш ТЛ Mice 
inhibition within this classical conditioning 


° Witte K L. Chil ren's instrumental response 
K.L dren's instrur 1 › : 
did CS Она чы! cordoned simi Med procedure is reported by Suboski, DiLollo, 
in iliarizati «perience. npublishe a de V m 
Masters ue se ge Iowa, 1965. and Gormezano (1964) using the nictitating 
du, ien, J. A. Latent inhibition: Generalization membrane response in rabbits. With adult 
" sis [es ee poe e human subjects using eyelid qo o. 
s Хог Carolina State University. ` incedo 69. usi do И 
7 Schnur, P. Latent inhibition: The effects of negative findings are er ы hs 
| чуер forced preexposure of the CS in differential et al. (1948), Grant et al. (1951), Perlmuter 
id Conditioning. Unpublished master's thesis, (1966), Allen (see Footnote б], and Schnur 


arolina State University, 1967. 


Ц 
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(see Footnote 7). Only Schnur and Ksir 
(1969) have reported latent inhibition. in 
humans with eyelid conditioning. While 
with infrahumans, 12 of 13 classical condi- 
tioning experiments have yielded a latent 
inhibition effect, with adults, 6 of 7 studies 
have failed to find a latent inhibition effect. 
Classical conditioning—conditioned fear as 
a suppressor. Latent inhibition has been 
demonstrated in a variety of to-be-suppressed 
responses: licking rate (Carlton & Vogel, 
1967 ; James, 1971; Leaf et al., 1968; Lubow 
& Siebert, 1969; Schnur, 1971), bar-press 
rate (Anderson et al, 1969a; Anderson 
et al., 1969b; Domjan & Siegel, 1971; May 
et al., 1967 ; Rescorla, 1971; Siegel & Dom- 
jan, 1971), and running speed (Anderson 
et al, 1969b). The conditioned suppres- 
sion technique, as applied to latent inhibi- 
tion, has been employed in 19 independent 
experiments reported in 13 articles. Of 
these only the last cited failed to demon- 
strate the latent inhibition effect. 
Avoidance—escape conditioning. — Vive 
studies have been performed with this task. 
Four of the studies demonstrated the latent 
inhibition effect (Ackil & Mellgren, 1968: 
Ackil et al, 1969; Braud, 1971: Grant & 
Young, 1971) and one study failed to dem- 
onstrate it (McDaniel & White, 1966). 
Discrimination of go/no-go stimuli. This 
group of studies extends the latent inhibi- 
tion phenomenon to an operant situation 
where the unconditioned stimulus is appeti- 
tive. With the exception of one study which 
employed salivary conditioning with dogs 
(Konorski & Szwejkowska, 1952), all of 
the animal studies discussed previously have 
used classical conditioning with a noxious 
unconditioned stimulus. The procedure used 
in the three experiments by Mellgren and Ost 
(1972) differs from the typical latent inhibi- 
tion study. During some nonreinforced pre- 
exposure periods reinforcement is present 
but not correlated with any stimulus nor con- 
tingent upon any response, n addition 
both the SP and sa ате successively pre- 
EV analyses of responding to 
bec he il that the poorer dis- 
Bs n d папсе by the preexposed 
1€ nonpreexposed rats was 
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primarily a function of increased respouding 
to Sè by the stimulus preexposure groups. 
This general trend was apparent in all three 
experiments. Similar findings are reported 
by Halgren* using rats to lever press for a 
sucrose solution and by Seybert? using а 
runway brightness discrimination task with 
running speed as the dependent variable. 

Gustatory aversion conditioning. A series 
of studies concerned with the aversion of @ 
relatively novel food as compared to a 
familiar food following induced illnes also 
may reflect the effects of latent inhibition. 
A number of studies find that aversion to 
flavors after induced toxicosis is reduced by 
familiarization or preexposure to that flavor 
( Domjan,'? Farley, McLaurin, Scarborough. 
& Rawlings, 1964; Garcia & Koelling. 1967 ; 
Maier, Zahorik, & Albin, 1971; McLaurin: 
Farley, & Scarborough, 1963; Nachman, 
1970: Revusky & Bedarf, 1967; Revusky & 
Garcia, 1970; Wittlin & Brookshire, 1968). 

Ivanov-Smolensky conditioning . Condi- 
tioning by the method of Tvanov-Smolensky 


(1933) is an adaptation of the classical re 


lovian procedures to humans. In this tec? 
nique the conditioned stimulus is paired with 
a verbal command which serves as the U 
conditioned stimulus. И 

There have been. only two studies whic? 
used this technique in conjunction with ш. 
reinforced preexposure (Siebert ™ ; Sokol 
& Paramanova, 1956). The latter ше 
using adolescents and an electromyogram «c 
sponse, as reported by Sokolov (1963), vs à 
cates that nonreinforced preexposure eet j- 
in a severe handicap to subsequent es 
tioning. The study by Siebert, employ m 
college age subjects and anticipatory this 


ҮС 
Associat! 


з Halgren, C. R. Latent inhibition: reser” 


or nonassociative? Unpublished doctoral d 
tion, Indiana University, 1971. follow’ 
9? Seybert, J. A. Discrimination learning 

ing preexposure to the discriminative ы ШОЛ 
published master's thesis, University of Е 
9 d 

M The CS preexposure effect by ef 
responses: 
Zastern tor 


anov-5n Not 
M ts thes! 
conditioning. 


Carolina University, 1967. 
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Muscle movements, although only obtaining 
low levels of conditioning for all groups, also 
demonstrated the latent inhibition effect. 

Conditioning of autonomic responses in 
humans. Sokolov (1963, p. 249) briefly 
refers to the nece ssity to reevoke the galvanic 
skin response (as one member of the orient- 
ing response family) after nonreinforced 
trials before conditioning can occur, and that 
it takes more trials to evoke the galvanic 
skin response after nonreinforced preexpo- 
sure than without nonreinforced preexpo- 
Sure. Wolff and Maltzman (1968) used 
three groups of undergraduates receiving 
either 0, 20, or 40 nonreinforced preexpo- 
sures to word stimuli. The O-nonreinforced 
preexposure group reached a peak galvanic 
skin response conditioned response at the 
fourth conditioned stimulus, 20 nonrein- 
forced preexposures at the seventh condi- 
tioned stimulus, and 40 nonreinforced pre- 
exposures at the sixth conditioned stimulus. 
Surwit ® presented groups of subjects with 
either 50 or 100 nonreinforced ргеехро- 
sures to the to-be-conditioned stimulus or to 
an irrelevant stimulus. The 50- and 100- 
nonreinforced preexposure groups exhibited 
fewer galvanic skin response conditioned re- 
Sponses than the groups exposed to the ir- 
relevant stimulus. Amplitude, magnitude, 
and latency of the galvanic skin response 
Were also analyzed but showed no significant 
differences between groups. 

Response speed. Cantor (1969) has re- 
Viewed the stimulus familiarization effect on 
Motor task behavior. His review deals pri- 
marily with children and with reaction time 
Measures. Stimulus familiarization effect, 
“responding less quickly in certain motor 
tasks when the signal to respond is one with 
Which the subject has been familiarized as 
Opposed to a different one not involved in 
any prior familiarization operation [Cantor, 


1969. р. 144]," has been demonstrated in 
l'ive of these 


experiments with children. 
Studies utilized lever pulling (Cantor & 
"пог, 1964; Kubose, 1970; Witte (see 
“Sotnote 5) ; Witte, 1967; Witte & Cantor, 


of Surwit, R. S. The anticipatory modification 
y tie Conditioning of a fear response in humans: 

Sit Published doctoral dissertation, McGill Univer- 
У, 1972. 
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1967), one study used simple reaction time 
(Bogartz & Witte, 1966), and five used a 
choice reaction time situation (Bogartz & 
Witte, 1966, Experiment 2; Cantor & Can- 
tor, 1965, 1966; Cantor & Fenson, 1968; 
Miller, see Footnote 4). In contrast to 
these 11 studies with children, all demon- 
strating the latent inhibition effect, there is 
one study (Meyer & Joseph, 1968) with 
female college students which reports no 
stimulus familiarization effect, 


RS OF LATENT INHIBITION 


DARAMETE 


Number of Preexposures 

The range of 1-nonreinforced preexposures 
across studies goes from 1 and 2 (Domjan 
& Siegel, 1971; Miller, see Footnote 4) to 
280 (Suboski, DiLollo, & Gormezano, 1964), 
300 (McDaniel & White, 1966), 550 
(Siegel, 1970), 1,300 (Siegel, 1969a, 1969b), 
and 4,176 (May et al, 1967). Most of the 
other studies use an -nonreinforced preex- 
posure of between 5 and 80, with the typical 
n-nonreinforced preexposure being 20 or 40. 

There are 12 studies that specifically in- 
vestigated the effects of n-nonreinforced pre- 
exposures. That is, they had at least two 
levels of a-nonreinforced preexposures in 
addition to the O-nonreinforced preexposure 
Table 2 presents the 12 
studies with -nonreinforced ргеехроѕиге 
levels for each group in each study. The 
numbers in the columns refer to the n-non- 
reinforced preexposure for a particular ex- 
perimental group. A number not under- 
lined indicates that group which exhibited a 
significant. decrement in performance rela- 
tive to the O-nonreinforced | preexposure 
group. A number underlined indicates no 
difference between that group and the 0- 
nonreinforced preexposure group. 

The conclusions from inspecting Table 2 
are quite straightforward, yet somewhat 
startling in that the data from which they 
are drawn are almost completely consistent 
between experiments. Of 14 different 
groups from 6 experiments only 1 (Cantor 
& Cantor, 1966) shows an inhibitory effect 
with less than 17 nonreinforced preexpo- 
sures. On the other hand, within the very 
narrow range of 16-20 nonreinforced pre- 


control condition. 


402 ROBERT E. LUBOW 


TABLE 2 


UMBER OF NONREINFORCED 
NT INHIBITION 


Number of nonreinforced preexposures 
Experiment ; کڪ‎ NEE — ڪڪ‎ 
1-5 6-10 | 11-15 | 16-20 | 21-40 | 3140 | 41-80 | s1100 | 101+ 
Ackil & Mellgren (1968) | 5 | | ч | % 
Cantor & Cantor (1966) 5 | | | | 35 | 
Cantor & Fenson (1968) | 5 10 |12 14|16 18 | | | 
Chacto & Lubow (1967) | | 20 | 40 
Domjan & Siegel (1971) 10 | | 25 50 
Grant el al. (1951) 5 10 | 
Lubow (1965) 20 40 
Lubow et al. (1968) | 40 80 
Mellgren & Ost (1972) | 75 |150 300 
Miller (1966) 1 | 10 | 20 40 
Siegel (1969a) | 100 | 1300 
Wolff & Maltzman (1968) | | 20 | 40 
ылы, RE А MENS. Em ا‎ 


Nole. Numbers in the columns refer to n-nonreinforced preexposure. Underlined numbers signify failure to obtain LI; number? 
not underlined signify success in demonstration of LI. E 


exposures, suddenly 5 of 6 groups from 5 Days. Siegel (1970), using rabbits and the 
different experiments show inhibitory ef- conditioned eyelid response, found the ust? 
fects. This consistency is all the more latent inhibition effect as a function of pre 
surprising when one remembers the diver- exposure but no significant differences as a 
sity of the 12 studies in both species and function of time of delay, O and 24 hours: 


learning paradigm. A study by James (1971), using 0-, 305, an 
TE 60-minute time of delay and a conditione 
Stability of the Latent Inhibition Effect— effect response procedure, provides suppo! 
Temporal Duration for Siegel (1970). Several studies pet 
Temporal duration is defined as the time the suppression technique with a ue 
between the last nonreinforced preexposure delay between nonreinforced preexpost? ml 


and the beginning of acquisition. One of conditioned stimulus-unconditioned pum 

the few animal studies which did not obtain Pairing and still report marked latent ? 967: 

а latent inhibition effect used such a 24-hour Чоп effects (eg. Carlton & Vogel, 

peel (Suboski et al, 1964). Several Lubow & Siebert, 1969). 

studies have followed whi speci in- 

vestigated the role of mul dm LE Stimulus Specificity of Latent = ‘ng th 

bow et al. (1968), using rabbits and a An important factor for explains | of 

‚ше Pinna response, with 0- and 24- latent inhibition effect is the knowle au 

esed Fur the delay variable to be whether nonreinforced реони a ge 

with no only in higher order interactions — to-be-conditioned stimulus results Ф 
nreinforced preexposure, Blocks and eral decrement in subsequent per 


hibition 


or whether the inl 
only to that stim 
exposed. The stuc 
each indicate unce 
inhibition effect is stimulus specific (Carl- 
& Vogel, 1967, Experiment 2; Lubow 
& Moore, 1959; 
Siegel, 1969a). 
There is another aspect to stimulus speci- 
ficity, the place of preexposure in relation 
to place of testing. A series of studies by 
Anderson and his 


ton 


tioned sup 
designed t 
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übitory effects are limited 
lus which has been pre- 
lies performed in this arca 
{uivocally that the latent 


Schnur & Ksir, 1969; 


associates, using condi- 


pression of bar pressing, were 
9 test the effects of place pre- 


exposure, conditioning, and testing on the 
Strength of fear 
inhibition (Anders 
et al., 1969a, 1969b). It appears that latent 
inhibition is increased when the place of 
Preexposure is the same as the place of test- 


ing, 


However, ] 


üchieved when the 
different (May et al., 1967), 

Grant and Young (1971) demonstrated 
that preexposure to the test apparatus facili- 
tates subsequent | 
response as compared to a Sroup not having 
prior experience with the apparatus. On 
the other hand, preexposure to the condi- 
tioned stimulus wit 
11 the same avoidance acquisition scores as 
the nonpreexposed-apparatus group. 
Mellgren and Ost 

; using a g0/no-go discrimination, report 
а similar finding. 
Чиге for demonstrating latent inhibition is to 
Preexpose both the 


for 


Sure groups 
‘tre group to 


lat 


Conditioned sti 


the decremen 


i Tonounced than 


efr. 


Expr, 


eng partially 


1° assertion th 


conditioning and latent 
on et al., 1968; Anderson 


atent inhibition can be 
two places are distinctly 


earning of an avoidance 


hin the apparatus resulted 


(1972, Experiments 1 & 
Since the typical proce- 


n-nonreinforced preexpo- 


and O-nonreinforced preexpo- 
the apparatus, it may well be 


tal learning effects from 


mulus preexposure are more 


is currently recognized, 


masked by the facilitative 
Sets of the apparatus preexposure. 


NATIONS OF LATENT INHIBITION 


Tap; 
"ituation of the 


Orienting Response 


at acquisition of a condi- 


Tesponse js facilitated by the presence 


[3 orienting re 
Nüssian literatu 


flex is supported in both 
re (Sokolov, 1963; Voro- 


nin, Leontiev, Luria, Sokolov, & Vino- 
gradova, 1965) and the Western literature 
( Maltzman, 1907: Maltzman & Raskin, 
1965; Wolff & Maltzman, 1968). 1f learn- 
iug is facilitated by the presence of the 
orienting reflex, then any procedure that 
reduces the magnitude of the orienting reflex 
should also decrease the effectiveness. of 
learning. For example, habituation of the 
galvanic skin response to a list of words 
prior to conditioning reduces the level of 
galvanic skin response conditioning per- 
formance to a related word (Wolff & Maltz- 
man, 1968), 

If latent inhibition is the result of non- 
reinforced preexposure extinguishing the 
orienting reflex, any condition which re- 
activates the orienting reflex should restore 
normal conditionability, Schnur (1971), in 
two experiments, using rats and the condi- 
tioned emotional response paradigm, tested 
this deduction. In both cases the hypothesis 
Was not confirmed. 


Selective Filters 


Schnur (1971) also tested the Mackintosh 
(1965) derived latent inhibition explanation, 
The prediction is that nonreinforced preex- 
posure to one element of a later-to-he- 
compounded stimulus should enhance con- 
ditioning to the other element relative to 
a compound stimulus where both elements 
are novel. However, even if the position 
is proven basically correct, it would only be 
applicable to those few latent inhibition 
Studies that used within-subject designs, 
Both experiments reported by Schnur 
(1971) indicate that preexposing one condi- 
tioned stimulus element of a compound does 
not enhance conditioning to the other ele- 
ment as would be predicted by selective at- 
tention theory. Sutherland and Mackintosh 
(1971, p. 68) also evince a puzzlement about 
how to handle the latent inhibition data, 


Competing and Complementing Responses 


A different approach to finding ап ex- 
planation for latent inhibition is to look at 
overt responses occurring during the non- 
reinforced preexposure period which are 
elicited by the to-be-conditioned stimulus 
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and which may either complement or inhibit 
conditioning performance. Lubow and 
Moore (1959) suggested that latent inhibi- 
tion was a result of learning a response to 
the nonreinforced preexposure stimulus that 
was incompatible with a later-to-be-condi- 
tioned response. However, a test of this 
hypothesis using leg flexion and extension 
in sheep and goats failed to confirm the 
predictions. Likewise, Lubow et al. (1968) 
showed that latent inhibition was not the 
result of the habituation of alpha responses 
during nonreinforced preexposure. 


r z sje 
Conditioned Inhibition 


Rescorla (1971) defines conditioned in- 
hibition as a “stimulus which comes through 
learning to specifically interfere with excita- 
tion [p. 77]." He suggests dual testing 
criteria for identifying the presence of a 
conditioned summation with a 
known excitor and retardation of the devel- 
opment of a conditioned reflex, 

In the Rescorla (1971) study, the first 
experiment confirmed the earlier findings 
that preexposure retards subsequent condi- 
tioning of suppression. However, the sum- 
mation test for nonreinforced preexposure 
effects failed to provide evidence for condi- 
tioned inhibition. 

Rescorla (1971) also suggested that if 
conditioned inhibition occurred to the pre- 
exposed stimulus, then a-nonreinforced pre- 
exposure should interfere with any subse- 
quent learning to that stimulus, not only 
in excitatory conditioning but also in inhibi- 
tory conditioning. This was tested by trans- 
ferring from a latent inhibition procedure to 
a conditioned inhibition procedure. Evi- 
dence for nonreinforced preexposure-pro- 
duced conditioned inhibition was not found. 
A related study by Reiss and Wagner 
(1972) provides additional support for the 
independence of latent inhibition and condi- 
tioned inhibition. These conclusions are 
buttressed by the results of the three studies 
demonstrating latent inhibition within the 
£0/no-go discrimination paradigm ( Halgren 
| see Footnote 3]; Mellgren & Ost, 1972; 
oe zum. 91). In these studies, 
Ee di s stimulus which was later 

stimulus for inhibiting 


Tes 
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sponding resulted in an increased responding 
to S3 as compared to a O-nonreinforced pre- 
exposure group. A conditioned inhibition 
explanation of latent inhibition calls for de- 
creased responding. 


Conc. 


SIONS 


The latent inhibition phenomenon is a 
robust one, appearing in a variety of spectes 
and ta: A number of variables have 
been found to have orderly effects on latent 
inhibition, particularly n-nonreinforced pre- 
exposure. Nevertheless, a review of the 
several explanations of latent inhibition 1n- 
dicates that none has strong support. W hat 
remains is an explanatory residue which 
currently goes by the name of stimulus 
salience or attention decrement. Indeed. 
there is some support for the idea that non- 
reinforced preexposure effects a central at- 
tentionlike process. Carlton (1969) reports 
that scopalomine administered before the 
preexposure sion of a conditioned emos 
tional response procedure results in subjects 
having reliably higher conditioning scores: 
They exhibit more suppression in the test 
phase than subjects receiving nonreinforce 
preexposure without the drug. Similar T€ 
sults have been obtained by Oliverio (1968) 
with mice. Scopolamine evidently reduces 
the amount of habituation that can accrue 
from repeated stimulation. This anticholi 
ergic effect, the attenuation of habituation, p 
conversely the maintenance of attention, has 
been well documented for a variety of Я 
nomena other than latent inhibition (Carlto™ 
1969). 

Ackil et al, (1969) have added furt 0 
support for the central attentional wh 
n-nonreinforced preexposure. Three ТО" 
of rats were given either hippocampa b 
sions, neocortical lesions, or no xw. 3 
These main groups were subdivided Mare 
30-nonreinforced preexposure and he ne 
inforced preexposure аш, ко each 
cortical lesioned and nonlesionec P ноп 
demonstrated the usual latent inhibi 


phe- 


het 


ure 
nnne тоб! 

fect in an avoidance conditioning promised 

However, for the  hippocampe form 


T мар 
iffere in 0^ 
groups there was no difference be he no 
= H Fc Ti 5 sing 
ance between those subjects y m rec ivit 
reinforced preexposure and thos 


~ 


» 1 


- 


«Мга гт, 
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30-nonreinforced preexposures. Other stud- 
ies, (see Douglas, 1967 ; Kimble, 1968), have 
demonstrated that the hippocampus plays an 
important role in the modulation of atten- 
tion, and, specifically, that hippocampectomy 
results in the inability to attenuate atten- 
tion-like responses to repeated stimulation. 

Since the effects of these centrally operat- 
ing drugs and lesions are widely regarded 
as resulting in a failure to produce atten- 
tion. decrements and the same drugs and 
lesions also prohibit the development of 
latent inhibition, it is suggested that during 
nonreinforced preexposure, for the normal, 
nondrugged, nonhippocampectomized sub- 
ject, there is a centrally mediated decrement 
of attention to the preexposed stimulus, and 
that this decrement is responsible for the 
subsequent latent inhibition effect. 

The explanations of latent inhibition that 
were examined emphasized cither a learning 
or an attentional mechanism. The learning 
approach sought to discover relatively gross 
motor responses associated with nonrein- 
forced preexposure such as leg flexor and 
extensor antagonism, or to look for condi- 
tioned inhibitory effects Where the stimulus 
acquires general inhibitory properties which 
effect the organism's responding to not only 
that stimulus but to other stimuli with which 
it might later be compounded. Attentional 
explanations of latent inhibition simply con- 
ferred a degree of reduced stimulus saliency 
аз а result of nonreinforced ргеехроѕиге. In 
Seneral, this type of explanation is congruent 
With the data from the stimulus compound- 
ing studies of Reiss and Wagner (1972), 
Rescorla (1971), and Schnur (1971), and 
Meets the requirements of stimulus speci- 
ficity, However, it does not suggest a mech- 
‘nism by which nonreinforced preexposure 
Teduces salience. Furthermore, it does not 
attempt to deal with the facts of generaliza- 
tion of latent inhibition (Siegel, 1969b), 
and its relative stability over time (ешр. 
Lubow & Siebert, 1969; Mellgren & Ost. 
‚ 7) but susceptibility to countercondition- 
Slike Procedures (Carlton & Vogel, 1967: 
тайы, 1971). These latter considerations 


to the need for a model that combines 
Ntion 


al and learning constructs. 
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THEORIES OF DREAM FUNCTION 
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Some contemporary and psychoanalytic theories of dream function are reviewed. 
Particular attention is given to Jung's model of personality and theory of dream 
function, a dynamic, open-system approach that stands in contrast to Freud’s 
mechanistic, drive-reduction model. Contemporary theories tend to focus on the 
function of environmental mastery, viewed from one of three perspectives: (а) 
problem solving, (b) information processing, or (c) ego consolidation. Only a few 
have gone beyond environmental control to consider creative functions of the 
dream. Most contemporary theories are at least partially supported by data, and 
many are not mutually exclusive, dealing with different processes or proposing 
similar processes in different-sounding languages. Some dream theories do differ 
from others in underlying model, in scope, or in the degree to which nonrational 
processes are admissable as data. Jung's approach has much to add to contemporary 


dream theory, particularly in making room for creative and nonrational processes, 
as well as in the specific proposition that dreams function to balance and complete 


waking consciousness. 


Many answers have been suggested to the 
question of what function dreams serve in 
man's psychology. In this century, the most 
influential theory has been Freud's and his 
views have given laboratory psychology a fruit- 
ful source of hypotheses and debate. The main- 
stream of American scientific thought in this 
area has grown directly from laboratory re- 
search but is richly sprinkled with Freudian 
theory together with a Darwinian emphasis 
on adaptation as environmental mastery. 
Jung's approach to dream function, very differ- 
ent from Freud's and perhaps more compatible 
with current research findings, has remained 
outside the mainstream and is virtually un- 
known to American psychology. This paper 
aims to correct the omission, while providing 
an overview of psychoanalytic and contem- 
porary thought about the function of dreams. 

Freud (1953) believed that dreams serve a 
dual, compromise function. According to his 
theory, unconscious, instinctual drive energy 
pushes for discharge, moving toward the ex- 
Pression of a consciously unacceptable impulse. 
The reduction in conscious restraints charac- 
teristic of sleep permits a symbolic, disguised 


dream expression of the repressed wish. The 
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overt (manifest) content he dream repre- 
sents a compromise between the instinctua 
forces (latent content) striving for expression, 
on the one hand, and the repressive forces of 
consciousness, on the other. Freud assumed 
that the energy pushing for action would 
awaken the sleeper were it not for the dream 
which, through symbolic discharge, permits & 
return to sleep. Thus, the dream is seen 45 
serving the biological function of preserving 
sleep, as well as the psychological function 0 
discharging an unacceptable wish that might 
otherwise burst destructively into waking lito- 

Various aspects of Freud's dream theory 
have been reviewed from the point of aed 
contemporary dream research (Breger, eth 
Fisher, 1965; Foulkes, 1964; Hollander, 19 of 
Jones, 1968, 1970; Trosman, 1963), and ile 
changes in the philosophy of science.” W = 
conclusions vary, it is generally agreed that к 
least some aspects of the theory must be “Ж 
carded or revised. With respect to dream E 3 
tion in particular, the sleep preservation = ir 
untenable and the underlying model ud 
which the wish-fulfillment theory rests 
quires extensive revision. 

It must be emphasized that the р 
of Freud's theoretical model rests ж 
tinction between manifest and laten 
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content, that is, the assumption that meaning 
is concealed rather than revealed by the dream. 
Granting the puzzling, chaotic, and foreign 
flavor of much dream content, it is more 
straightforward to assume that the dream is 
an expression lacking the discriminative order 
imposed by conscious (“secondary process”) 
thought, and which must therefore be ordered 
and decoded if it is to be understood within 
the framework of normal adult thought. 
Whether dreams disguise, reveal, or are mean- 
ingless must remain a question of faith, but 
most theorists, including many whose work 


- builds upon Freud’s (e.g., Adler, Eriksen, Jung, 


Maeder, and Silberer) incline toward the view 
that they reveal. 

Despite the many problems inherent in his 
theoretical formulation of dream function, 
Freud’s far-reaching work has provided a 
fruitful basis for many of the contemporary 
theories discussed below. Psychologists steeped 
in the Freudian tradition have generated 
theories compatible with those arising from the 
laboratory alone, generally moving away from 
wish-fulfillment and drive-discharge functions 
toward an emphasis on adaptation and ego 
synthesis. Forerunners of this trend are to be 
found among Freud’s original psychoanalytic 
group and, indeed, can be discerned in Freud’s 
own brilliant interpretative work which, as 
Jones (1970) has emphasized, often ranges far 
from the letter of his theory. 

A large group of investigators approach the 
dream as essentially continuous with waking 
thought. While specifics of the theories vary, 
each in its own way attributes to the dream 
the function of facilitating some aspect of the 
dreamer’s mastery over his environment. 
Environmental mastery is usually approached 
from one of three points of view: (a) problem 
Solving, (b) information processing, or (c) ego 
Consolidation. : 

The problem-solving view of dream function 
Was already important to some of Freud's 
Carly followers. Maeder (1913), while he be- 
lieved with Freud that dreams have a drive- 
discharge function, also saw them as tentative, 
Practice attempts to solve problems. Adler 
ай Saw the dream as an oram Us ja 
| ea characteristic €— uh 
as a Solved reality problems and, with Mae E 

an attempt to meet and solve these problems. 
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In addition, Adler believed that dreams arouse 
emotions that push the dreamer toward prob- 
lem solution. However, he felt that the solu- 
lions embodied in dreams are self-deceptive, 
constituting a flight from realistic coping with 
life. 

Ullman (1959, 1960, 1962) has built upon 
Adler's theory but considers that dream con- 
tent itself constitutes an adaptive contribution 
to problem solving, rather than a flight from 
reality. He feels that the dream expresses 
symbolically the connections between a present 
problem and relevant aspects of past experi- 
ence; it permits a fuller assessment and thereby 
better solution to the problem than is available 
to waking consciousness alone. Ullman also 
looks at the adaptive value of the dream for 
species survival, assuming that the apparent 
lightening of sleep associated with the dream 
state originally evolved out of the necessity for 
primitive man to arouse periodically in order 
to survive physical danger. Dream images 
form an assessment of currently disturbing 
life situations and, depending on the outcome, 
the dream may either facilitate full arousal or 
promote a return to deeper sleep. 

Klein (1967) also maintains a close relation 
between daytime thoughts and the dreams of 
the night, seeing dreams as one mode of carry- 
ing on a series of ideas that may have been set 
in motion during the day. There is a certain 
amount of interchangeability among thoughts, 
fantasies, dreams, and actions in a “series of 
ideomotor events” that, once set in motion, 
pushes for completion in and of itself, not in the 
service of drive reduction. Building on this 
model, Fiss (1969) conceptualizes the need to 
dream as a need to complete unfinished tasks. 
Unresolved (incomplete) cognitive-emotional 
tasks constitute problems which continue 
toward completion during the night. 

Pearlman (1970), while he, too, believes that 
dreams play a role in solving problems, dis- 
tinguishes between their role and that of wak- 
ing thought. He points out that intellectual 
insight is different from the emotional kind of 
"knowing" that makes an insight a part of 
oneself and proposes that dreams function 
specifically to facilitate the emotional assimi- 
lation that gives adaptive reality to a problem 


solution. . 
Breger (1969) emphasizes the problem- 
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solving aspect of dreams for children in par- 
ticular, arguing for a developmental function 
facilitating the child's efforts at mastery. He 
follows Griffiths (1935) in pointing out that 
fantasy is the natural mode of thought in 
early childhood and the primary mode through 
which children work through emotional prob- 
lems. Like Piaget, Breger views dreaming as 
an assimilative form of thought which allows 
free "experimentation" with problem situa- 
tions and out of which emerge “fantasy 
programs" for creative new solutions. 

In another paper, Breger (1967) applies the 
problem-solving approach to an information- 
processing model and refers it to the solution 
of affective-personal problems in general, over 
and above those of child development. He 
hypothesizes that in dreams, current, affec- 
tively arousing problem situations are com- 
pared and “tested for fit” with various past 

"programs" that have served more or less 
satisfactorily to resolve earlier conflicts. Al- 
though couched in different language, this is 
similar to Ullman's approach and also to the 
views of French (1952) and French and Fromm 
(1964), who see dreams as attempted solutions 
of "focal conflicts." Likewise Greenberg (1970) 
suggests that in dreams, past and present 
experiences of a similar emotional nature are 
brought together and "filed," allowing past 
experience to facilitate new adaptation and 
new experience to alter old modes of coping. 
Breger (1967) points out that the dream state 
has a number of advantages over waking con- 
sciousness for dealing with emotional material : 
Stored information is more readily available; 
associational processes are more fluid; the 
criterion of social acceptability is at a mini- 
mum; and a greater variety of means for mani- 
pulating symbols is available. The overall 
effect is a “creative opening up of the memory 
systems [p. 25 ]." 

, Shapiro (1967, 1968) proposes a somewhat 
different information-processing model. Sleep 
and dreaming are seen as parts of a total 
organismic process whereby the nervous Sys- 
tem comes increasingly to approximate a 

map of external reality as a result of experi- 
= With corresponding changes in attitudes, 

m im are two modes of reality 
fOughly to кесп n Which corresponds 

А econdary-process thinking, and 
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the "comparing," which approximates pri- 
marv-process thinking. Both modes of thought 
operate during sleep and waking but the 
former is more dominant during waking, the 
latter during sleep when new data are c pared 
for relevance with past data and future ex- 
pectations. A feedback system normally pro- 
duces alternation between the two modes of 
thought, both of which are necessary for com- 
plete processing of incoming experience. T he 
noxious effects of sleep and/or dream depriVa- 
tion result from the accumulation of un- 
processed data. - 

Hawkins (1966), in a still different variation 
of the information-processing theme, concelves 
the waking state as one of constant input, the 
dream state as one of sorting and storage 
required after a certain amount of input has 
occurred. At the physiological level the sorting- 
Storage process can be seen as central nervous 
system reorganization. Translated into psycho- 
analytic language, it is a process of ego re- 
cathexis but not drive discharge. 

In fact, a number of the theorists who pro- 
pose an environmental-mastery function ap- 
proach dreams via the concept of ego consol 
dation. Eriksen (1954) has spoken of the 
"identity-preserving function of the dream, 
a view that Jones (1962) has elaborated: 
According to Jones, the dream advances e£ 
synthesis through its integration of new 
periences from waking life into a proc 
whereby the dreamer's “nuclear crises" are 
resolved. Similarly, Castle (see Footnote 2), who 
follows Piaget in understanding the dream bs 
egocentric and assimilative thought, suggest? 
that dreams maintain ego continuity by Ue 
lating new material to the dreamer's em 
history, thereby allowing the integration of d 
acknowledged material which would otherw a 
remain unassimilated. Following the same d 
of thought, Witkin (1969) hypothesizes n 
the course of a dream, the dreamer’s m " 
with respect to the central problem 15 edi 
improve, putting him in a position of relativ 
more mastery than at the beginning ipii 0 
dream. Witkin emphasizes the dimension |e 
psychological differentiation as 4 пез Uf! 
equivalent of ego development Gum "n (ue 
published work by Eagle showIng e tive 
dreamer is more often represented = ad de- 
participant. (implying relatively Mo 
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velopment) in the dreams of more differ- 
cutiated persons. 

Lerner (1967) points out that, while differ- 
entiation is important, very little attention 
has been given to the related question of the 
"reintegration" of experience. Her view, an 
outgrowth of Rorschach's work, is that dreams 
contribute to ego development by functioning 
to reintegrate the body into fantas which 
the dreamer cannot act out in waking life. 
Body image is seen as the basis for ego and, 
following Rorschach, Lerner proposes that 
kinesthetic fantasy is required to maintain co- 
herence of body image, hence of personality 
organization. Immobility is required for the 
production of kinesthetic fantasy, a condition 
that is met by the dream state but rarely by 
other natural conditions. 

Some of the mastery-oriented views of dream 
function concern themselves also with its crea- 
live aspects insofar as creativity is a tool in the 
practical business of gaining control over the 
environment. Only a few contemporary theo- 
rists have approached the contribution of 
dreams to the creative process as a value in and 
of itself, adding something new to human life 
beyond the utilitarian function of survival and 
environmental control. 

Krippner and Hughes (1970) view infor- 
mation processing secondarily as an early func- 
tion of dreams, which in later life continues to 
employ a mode of thought that is character- 
istic of childhood mental processes. This mode 
rarely appears in adult thinking but is a vital 
aspect of creative thought. These authors cite 
studies showing that creative people tend to 
remember more dreams than noncreative 
people and to have more exotic dreams; they 
also mention a number of historical personali- 
ties whose discoveries, inventions, and/or 
artistic products were mediated by dreams. 

Piaget (1951) is noteworthy in his view that 
dreams, along with play, constitute a creative 
process of construction, an essential aspect of 
the equilibrative mechanism through which 
human cognitive development occurs. He 
recognizes the subjective, affective concerns 
that are maximally operative in dreams as no 
less important than environmental reality in 
the interaction between organism and environ- 
ment. Like play, dreams may occur in a 


| dynamically compensatory relation to en- 
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vironmental reality. Ironically, the views of 
Piaget and of Jung, who have been sharply 
critical of one another, are in many respects 
more similar than the thought of either of 
these two Swiss psychologists is to American 
or to Freudian psychology. 

Like Freud's, Jung's theory of dream func- 
tion is an integral part of a larger view of 
personality, although it differs quite radically 
from Freud's. So far as specific functions of the 
dream are concerned, Jung's view incorporates 
many of those described above. As Ellenberger 
(1970) has pointed out, 


-+-Jung maintains that the functions of dreams are 
manifold. They can express fears as well as wishes; 
they can give a mirror picture of the dreamer's actual 
situation; there are prospective dreams; others are 
creative, warning, or parapsychological [p. 716]. 


Jung does, however, propose an overriding 
principle of dream function within which the 
specifics operate. 

With the rare exceptions noted above, con- 
temporary theories of dream function are 
characterized by an exclusive concern with 
adaptation to the world of concrete, outer 
reality. Jung gives equal emphasis to the world 
of subjective, inner experience. Further, un- 
like both Freud and contemporary theorists, 
Jung retains a deep concern for the irrational 
side of man's nature. His attitude is that the 
irrational is, whether we like it or not, and any 
approach to psychology that tries to exclude 
it is unable to explain, or even to describe, a. 
great deal of human behavior. Further, through 
reductionism the richness of human life and 
values becomes lost. 

Jung observed, svstematized, and named the 
phenomena he observed but did not consider 
himself to be a theorist. Working from obser- 
vation rather than theory, he felt no necessity 
to fit later observations into earlier formula- 
tions, nor to document his inconsistencies. He 
said (1953), “I ... prefer the precious gift of 
doubt, for the reason that it does not violate 
the virginity of things beyond our ken [p. 8].” 
Such fluidity has obvious advantages, together 
with the disadvantage (among others) that it 
is difficult to become familiar enough with his 
thinking not to misrepresent it. Nevertheless, 
an attempt is made here to set down the broad 
outlines of a model of personality functioning, 
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including dream function, that emerges from 
and underlies his observations. Because Jung’s 
view is little known, it is described in some 
detail. 

For Jung a construct called *the uncon- 
scious” is central. Jung’s view of the uncon- 
cious is different from Freud's and the two 
must be distinguished. It is difficult to define 
the unconscious in a precise way. As Jung said 
in a filmed interview, “The trouble with the 
unconscious is that it is unconscious!” Looking 
at the unconscious is a little like looking at the 
back of one's own neck; it cannot be done 
directly, but only through an arrangement of 
mirrors, leaving doubt as to its "true" nature. 
It is particularly difficult to define in the con- 
text of a psychology that has only recently 
begun to emerge from an empty-organism view 
to which the notion of consciousness is hardly 
acceptable, without which the contrasting un- 
conscious cannot be defined. That is, the un- 
conscious comprises “the totality of all psychic 
phenomena that lack the quality of conscious- 
ness [ Jung, 1960a, p. 133].” 

Nevertheless, as a beginning we can say that 
the unconscious includes all those aspects of 
the total personality that are not under con- 
scious control. This includes not only what was 
once conscious and has been repressed, but also 
innate potentialities that have not vet become 
conscious. Obviously, at a superficial level 
what is unconscious varies from moment to 
moment depending on fluctuations in attention. 

Unconscious contents include, for example, 
forgotten or repressed memories, psychological 
representations of bodily functions of which 
we are not aware, and innate and acquired 
patterns of behavior. The primary mirrors in 
which these contents can be observed are 
affects and images. As the spontaneous activity 
perhaps furthest from actual or potential con- 
Sclous control, dream images are one of the 


best mirrors of the unconscious. As von Franz? 
puts it, 


Our concept of the unconscious is а borderline concept, 


Tha simply describe by this term a real though in- 
E M for +++ the reality of the unconscious is a 
"t ed reality, not demonstrable directly. You can 
y that it is what produces dreams, that it is that 
M.-L. von Franz 


Niklaus топ der Fl 
1957. (Mies) — ^ 


The dreams and visions of St. 
Zürich: C. G. Jung Institute, 


JANET DALLETT | 


“X” which arranges the meaningful order of the dream. 
With a certain technique of interpretation we (oin bring 
out of the dream a meaningful connection, so we « 
clude that there must be something which produces 
the dream. We have ... a certain amount of fact and 
extract therefrom a concept which we call the un- 
conscious [ p. 2-2]. 
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Although in their manifestations thev are 
usually mixed together, Jung classifies uncon- 
scious contents as of two sorts: personal and 
collective. The personal unconscious consists — 
partly of the Freudian unconscious, namely, 
all those contents that have been repressed 
because they are too painful or incompatible 
with conscious views to recognize. In addition, 
there are neutral contents and, more important, 
memories and parts of the personality which 
may be acceptable in themselves but have 
been repressed because they are not in accord 
with the image of himself that the individual 
has developed. 

The collective unconscious constitutes all 
those aspects of the psyche that are basic to 
the human condition, those psychological 
characteristics that belong to us by virtue of 
our biological structure. It is part of being 
human that most of us are born with two legs, Lu 
two arms, a cerebral cortex, and the capaelty | 


ме 


to walk upright. These and all other aspect? 
of our biology have certain psychological con- 
sequences and lead to typical modes of func- 
tioning and patterns of behavior that are Te 
flected in the affects and images of the collec- 
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nuclei of energy which give impulse and ogi | 
tion to thought, perception, and at the ia 
stinctual level, behavior. These energic n 

are called archetypes, which are 

the necessary a priori determinants of all 


processes. Just as his instincts compel man to 


dstence, s ‚ archetyp! 
ally human mode of existence, so the archet) mie 
тт da into specifically 


his way of perception and apprehension 1 
human patterns [Jung, 1960a, р. 133]. 
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may express itself and is not invariant, for it 
is almost inevitably colored by both cultural 
and personal determinants. The archetype 
bears no relation to the notion that acquired 
ideas can be inherited. The archetype is no 
more mysterious than the fact that the brain 
15 programmed to work in particular ways and 
not in others, with particular psychological 
consequences. For example, one of the most 
fundamental attributes of the psyche is the 
tendency to perceive the world in dualiti 
this and not that, black and white, male and 
female, high and low, I and not I, etc. It is 
difficult to conceive of the development of the 
discriminative, organized thought character- 
istic of human consciousness without this 
fundamental tendency as a starting point. 

It is notable that Breger (1967), searching 
for a model more compatible with known bio- 
logical reality than is Freud’s theory of psychic 

for a con- 


energy, has arrived at the necessit 
cept which very closely approximates that of 
the archetype: 


The difficulties stemming from the psychic-energy 
theory may be in large part avoided by assuming that 
what is biologically basic (^instinctual" in Freud’s 
terms) is the direction given to thought and action by the 
structure or organization of the nervous system. The 
ethologists take this approach when they posit 
structural mechanisms underlying the “fixed action 
patterns" that comprise the *instinctual behavior" of 
certain birds and fish. Such structures have evolved, 
through the process of natural selection, because of the 
adaptive advantages they confer on the species possess- 
ing them. ... These guiding structures are embodied 
in the brain and nervous system and have evolved to 
their present state of complexity over long periods of 
time, just as have the other biological systems of the 
body. ... It is worth stressing that even in the case of 
the relatively fixed action patterns, it is typically goals 
or environmental achievements, and not rigid sequences 
of motor acts, that are built in. ... These sorts of 
guiding structures ... provide the direction of ongoing 
activities that is motivation [pp. 11-14]. 


As is evident from this brief overview, Jung’s 
view of the unconscious is far from Freud's 
seething cauldron. While recognizing the dark, 
Negative aspects, Jung sees the unconscious 
m addition as the often untapped source of 
Positive, adaptive, and creative potential; of 
Man’s highest possibilities as well as his lowest. 
xx 85 view, the personality can be уге 

› as dreams sometimes represent it, as 
circle within which consciousness 15 
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contained as a very small island. The ego, the 
"complex of identity" (see Whitmont, 1969), 
is the center of that tiny island of conscious- 
ness, but not of the whole personality. 

It is not unusual for the ego to think that 
consciousness comprises the total personality 
and that the vast area outside consciousness is 
either nonexistent or insignificant. The ego- 
centric view either ignores manifestations of 
the unconscious, reduces them to the point of 
view of the ego, or projects them outward, 
explaining them as forces acting from the out- 
side. Jung sees this as a drastically limited 
and limiting approach that reduces man to an 
infinitesimal portion of his potentiality. 

Jung observed that the content of individual 
dreams appears to bear a relation best described 
as compensatory to the attitude of ego-con- 
sciousness. To understand what he meant by 
compensation, one must bear in mind the 
picture of the total personality. The dream 
presents a point of view from outside conscious- 
ness but from within the whole personality. 
In simple instances the dream content may be 
directly complementary to the ego's attitude; 
for example, if the ego says "I am white," the 
dream may imply that the ego is in fact black. 
Usually compensation is more subtle, however, 
and a general statement of the principle is 
that the dream confronts the ego with what is 
now most necessary to bring its attitude into 
accord with the reality of the whole personality, 
to restore intrapsychic balance. As Jung 
(1960b) puts it, compensation implies *balanc- 
ing and comparing different data or points of 
view so as to produce an adjustment or a 
rectification, and ... a self-regulation of the 
psychic system [pp. 287/]." So long as the 
aims of the ego remain in relative harmony 
with the whole, the compensatory process 
appears to proceed automatically, without 
conscious attention from the ego. However, if 
the ego persistently behaves in ways that do 
violence to the whole, the compensatory 
activity of the unconscious becomes extreme 
and may result in neurotic symptoms, psycho- 
genic illness, moods, outbursts of affect, 
accidents, nightmares, etc. All such symptoms 
can be understood as compensatory responses 
to an ego attitude that has become too one- 
sided. When they arise, serious attention to 
ims can provide clues to the inner or outer 
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changes required to restore more balanced 
functioning. 

Jung did not believe that the dream “tries” 
to conceal anything. Rather, he saw the dream 
as the best possible symbolic (usually not 
literal) expression of a situation. The language 
of unconscious symbols is different from that of 
rational, verbal, or linguistic symbols, however, 
and a good deal of work is required to under- 
stand a dream in its own terms. Jung (1966a) 
says of this: 


The true symbol differs essentially from [the sign or 
symptom], and should be understood as an expression 
E intuitive idea that cannot yet be formulated in 
any other or better way. When Plato, for instance, 
puts the whole problem of the theory of knowledge in 
his parable of the cave, or when Christ expresses the 
idea of the Kingdom of Heaven in parables, these are 
genuine and true symbols, that is, attempts to 5 
something for which no verbal concept yet ex If 
we were to interpret Plato's metaphor in Freudian 
terms we would naturally arrive at the uterus, and 
would have proved that even a mind like Plato's was 
still stuck on a primitive level of infantile sexuality. 
But we would have completely overlooked what Plato 
actually created out of the primitive determinants of 
his philosophical ideas; we would have missed the 
essential point. . .[p. 70]. 


pre 


Dreams can, and sometimes do, refer to 
outer situations and people, and they are often 
understood as such on what Jung calls the 
objective level. Often, even when a dream 
contains people and situations from the outer 
world, it is more meaningful to understand it 
in а completely inner way, on the so-called 
subjective level. Taken subjectively, every- 
thing and everyone in the dream embodies 
an aspect of the larger personality. For ex- 
ample, a very unsociable person may dream 
about a life-of-the-party friend who embodies 
his own unlived sociability. Whether it is 
viewed objectively or subjectively, every 
dream must be seen in the context of the life 
Situation and conscious attitudes of the 
dreamer as the principle of compensation im- 
plies. It is impossible to be certain of the 
meaning of a dream out of context. 
ee Jung, then, individual dreams 

l omplex w ays to compensate ego- 
e cannot say “in order to 
scription of le bae У газ ндн 
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sents an image in much the same way as nature 
allows a plant to grow, and it is up to us to 
draw conclusions [ Jung, 1966b, p. 104].” е 

When dreams are observed and "related to* 
(valued, with an attempt to understand them) 
over a period of time, several things happen. 
If the ego-attitude has been extremely one- 
sided, it may move closer to the center of the 
whole, in which case neurotic symptoms 
generally disappear. The conscious personality 
also broadens, so to speak, as more and more 
of the inner possibilities are accepted and lived. 
And one begins to notice, longitudinally, & 
second function of dreams. The specific com- 
pensations of each night begin to reveal ы 
complex pattern that appears as if all the 
conflicting and diverse potentials of the total 
personality “wanted” to be realized in the life 
of the individual in what Jung calls the process 
of individuation. It is as if the point-counter- 
point between ego and unconscious were to- 
gether moving toward some ideal goal of the 
fulfillment of the whole man. Thus Jung infers 
a developmental function of dreams which 
moves, through the whole of life, toward the 
realization of the total personality. Dement 
(1966) has said of rapid eye movement (REM) 
sleep, 


never before in the history of biological research ha à 
much been known about something from a descripti Ve 
point of view, with so little known at the same UM 
about its function. Yet to doubt that the REM es 
has a major and vital biological destiny to fulfil e 
to fly in the face of well-established evolutionan 
principles Гр. 136]. ~ 


Likewise, given the universality of dreaming 
there can be little doubt that dreams € 
selves play some vital role in man's р5У© " 
economy, vet little factual basis has emerge 
to warrant the choice of one hypothesize“ 
function over another. Empirical evidence 0 
been marshalled in support of each of the F И 
temporary theories. Мапу аге not wr. 
exclusive but differ primarily in emp e 5 
each illuminating a different aspect of the nm 
process. Problem-solving, wfermatu pu into 
ing, and ego-consolidation theories power 
one another and often appear to be y Klin use 
similar or identical processes, despite 
of different languages. 

There are a few dimensions upon W ү? 
theories of dream function differ subst: 


ing 
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fror others. One such dimension is that of drive 
charge as opposed to balance or dynamic 
equilibrium as an underlying model. Most 
drive-discharge approaches, for which Freud’s 
theory is the prototype, tacitly conceive the 
organism as a relatively static, mechanic lly 
closed system that is acted upon from within 
and without and that moves primarily to rid 
itself of stimulation in order to return to its 
original state. Most balance theories, for which 
Jung’s is the psychological prototype, view 
the organism as a dynamic, biologically active, 
open system that seeks and interacts with 
stimulation in such a way as to take in, or 
integrate, the stimulus in order to arrive ata 
new and different state of balance. It is note- 
worthy that Freud, and also Dement (1968) 
who has been one of the primary contemporary 
proponents of drive discharge, initially ap- 
proached the dream via abnormal psychological 
states. If we begin instead with the normal 
condition, it is more natural to think, as 
Fisher (1965) suggests, of a balance mechanism 
maintaining a steady state of dreaming, with 
imbalances associated with disturbance. 

A second respect in which dream theories 
clearly differ is in scope. It is a question of 
aesthetic preference whether one chooses to 
work with segments or wholes; whether one 
prefers the sense of cer ainty that can come 
from focusing on a limited area or would 
rather tolerate the ambiguity involved in 
working with more inclusive concepts. For 
those who prefer to think on the grand scale, 
Jung's principles offer а virtually untapped 
means of encompassing a great deal of existing 
data, not only on dreams but also in other 
areas of symbolic expression like fantasy, play, 
drama, art, mythology, and religion. 

A third real difference among theories is the 
extent to which subjective evidence and non- 
rational processes are admissable as data. With 
the exception of the few who have looked at 
the creative function of the dream, contem- 
porary theorists have stayed very close to a 
concept of function that derives from bio- 
logical adaptation to the outside world. The 
conceptual leap required to think in terms of 
inner necessity and subjective function does 
not come easily. Questions like “Where is it 
leading, psychologically?” and “What is its 
psychological meaning or value?” have often 
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been confused with metaphysical questions 
about a prime mover with a purpose outside 
human psychology. This confusion is unneces- 
sary and leads away from many of the central 
problems of psychology, the points at which 
man differs from other animals. Because 
scientific psychology, which must exclude non- 
rational methods, tends unnecessarily also to 
exclude nonrational data, we split into radical 
positivists on the one side and radical hu- 
manists on the other, each leaving out an 
essential aspect of a complete psychology. 
Jung has made room for nonrational data in 
his nevertheless. rational theory of dream 
function. 

Within certain very broad limits, our theories 
reflect our values which in turn affect the 
values of those who turn to us as experts. 
American psychology has grown up in an 
atmosphere of materialistic and utilitarian 
values which has had no small effect on our 
preference for materialistic and utilitarian 
theories. If we believe that control of the en- 
vironment is the only important human psy- 
chological value, then we will consider func- 
tional only processes that promote environ- 
mental control. If we value artistic creation, 
harmony with our neighbors, or a sense of 
meaning in life as important goals in and of 
themselves, then psychological processes lead- 
ing to these ends will be considered functional. 
Dream psychology is one area in which it is 
difficult for long to close the door to the latter, 
specifically human functions. 
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